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bases. Because of the large amount of data it can handle, com- 
putational linguistics provides valuable contributions to well- 
established fields in the study of the biblical languages, such as 
orthography or syntax. In addition, more advanced computation- 
al techniques, such as author clustering and text-classification, 
provide new ways to approach long-standing problems such as 
source-critical questions and genre distinctions. The application 
of computational linguistics to the biblical languages also touches 
upon fundamental questions about the interpretation of the Bible, 
because it blurs the traditional distinction between the sciences, 
conceived of as a mode of scholarship involving calculation and 
pattern recognition, and the humanities, understood as a herme- 
neutic and critical mode of scholarship. 
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the Hebrew Bible. A complexity approach to linguistic theory 
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accordingly aims to situate a particular subject of study within a 


context that more closely resembles the complex interactions of 


Abstracts and Keywords xxi 


various dimensions of the system. CT is different from the other 
linguistic theories presented in this volume in that it functions as 
a metatheory. An understanding of the general principles of CT 
can mitigate an inclination towards reductionist analysis and 
enable researchers to perceive fundamental properties and inter- 
actions of language phenomena that remain opaque to traditional 
theoretical frameworks. As a metatheory, CT can provide a help- 
ful perspective for the scholar or practitioner, transforming an 
analysis via the questions it raises, the manner in which answers 
to these questions are pursued, and the conclusions ultimately 
drawn. A brief overview of CT is provided with particular atten- 
tion to its application in the field of linguistics. 
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INTRODUCTION 


Elizabeth Robar 


Scholarship on biblical languages is rooted in Classical Philology, 
but Classical Philology has all but disappeared after birthing the 
full range of humanities. The linguistics portion of philology has 
developed several separate schools of linguistic thought, each of 
which offers a different perspective on how to analyse and 
explain language. These different perspectives heavily influence 
how we interpret the biblical text, which places a burden on the 
student and scholar not to remain ignorant about formative 
influences behind their own interpretations. 

This edited volume presents the most important linguistic 
theories in use today for interpreting the biblical languages, with 
the intent of educating the student and scholar about what lies 
behind many of their favourite language tools and resources. All 
too often, both student and scholar are unaware of the prior theo- 
retical commitments that drive many of the conclusions in these 
resources. 

All work is, of necessity, done within a theoretical frame- 
work of some kind. When that framework remains unexamined 
and unarticulated, it becomes a liability for interpretations based 
on it. Ignorance of theoretical frameworks has become perva- 


sive and problematic enough, even in academic circles, that one 
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response is to require that linguistic proposals submitted for con- 
ferences express ‘a well-articulated linguistic method’ in order to 
even be considered for acceptance (Society of Biblical Literature, 
Linguistics and Biblical Hebrew section). 

This volume is a response to the need for students and 
scholars first to understand the various theories on offer, in order 
to be able to decide which theoretical framework is most suitable 


for their own work. The chapters fall into three categories: 


1. Distinct linguistic theories, each with their own presuppo- 
sitions, purpose and methods. Scholars with a strong theo- 
retical background will often identify with one of these in 
particular. 

a. Generative linguistics (Chapter 2) 
b. Functional Grammar (Chapter 3) 
c. Cognitive Linguistics (Chapter 4) 
d. Historical linguistics (Chapter 5) 

2. Practical implementation of a theory, which has been 
applied to each of the above theories. 

a. Computational linguistic analysis (Chapter 6) 

3. Umbrella theory that seeks to enable interaction between 
theories, as a practical form of interdisciplinarity. 


a. Complexity Theory (Chapter 7) 


Significant areas of linguistic theory not covered in this 
volume include structuralism and typological linguistics. The 
influence of these theories on biblical studies has either been 
indirect or very localised. Chapter 3 (Ross) provides an overview 
of structuralism as it bears on generative linguistics and Cogni- 


tive Linguistics. Typological linguistics, in connection to Biblical 
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Hebrew, is best known from John Cook’s application of Bybee, 
Perkins, and Pagliuca (1994) in his Time and the Hebrew Verb 
(2012). 

It should be noted that Chapter 4 is entitled Computational 
Linguistic Analysis rather than Computational Linguistics, because, 
as van Peursen writes, application of computational linguistics 
has been largely restricted to syntactic databases of the biblical 
text, along with a few dissertations on machine learning. The 
term computational linguistics now tends to refer to Natural Lan- 
guage Processing (NLP) and machine learning, which are only in 
their infancy within biblical studies. 

These linguistic theories differ in specific areas. One such 
area is how the nature of language itself is understood. Ferdinand 
de Saussure famously distinguished between la parole (spoken 
language, with all the messiness of everyday interaction) and la 
langue (the abstract system of language, as found in edited writ- 
ing). Spoken language may be broadened to communication, 
whether largely linguistic (Functional Grammar) or embodied 
and extending far beyond language (Cognitive Linguistics). For- 
mal (written) language is understood as grammatical (following 
the rules) or ungrammatical (generative linguistics). 

The nature of grammar flows from the understanding of 
language: it can be inherently meaningful, a key to interpreting 
communication; or it can be a formal system explicating the 
abstract linguistic system. Theories approaching language as an 
abstract system have different purposes for their grammatical 


analysis: the diachronic (historical) development of language 
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(historical linguistics), the synchronic reality (e.g., structural- 
ism), or the delimitation of acceptable (‘grammatical’) construc- 
tions (e.g., generative linguistics). 

While not a focus of this volume, the extent of language 
analysed has also been a significant distinguishing factor between 
theories: whether a theory addresses grammar of linguistic levels 
at the sentence and below (e.g., structuralism, generative linguis- 
tics), or linguistic levels above the sentence (e.g., discourse analy- 
sis), or conceptual representations beyond language itself (e.g., 
Cognitive Linguistics). 

A final distinction between these theories is the priority 
they accord to various linguistic levels: morphology (e.g., struc- 
turalism and historical linguistics), semantics (e.g., Cognitive Lin- 
guistics), syntax (e.g., generative linguistics), and pragmatics 
(e.g., Functional Grammar). 

Complexity Theory stands out for not taking an explicit 
stance on any of these questions, precisely because its purpose is 
to enable different systems to interact while preserving their own 
internal composition. For perhaps each of the theories, theoreti- 
cal clarity, in the form of dogmatic claims, has often come at the 
cost of explanatory adequacy, when confronted with the com- 
plexity of real language and texts. No one theory can explain 
everything. Complexity Theory aims to create space for each sep- 
arate theory both to operate on its own and to interact with other 
theories. 

Each chapter follows the same format: history and develop- 
ment of the theory; key theoretical commitments and major 


concepts; use and contributions in biblical studies to date; and 
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prospects for further study, application, and collaboration. Each 
chapter concludes with both a handful of suggestions for next 
reading (handbooks, introductions, and foundational texts) as 
well as a full bibliography for fuller research. Each chapter is 
therefore self-contained and may be read on its own, but the par- 
allel structure should facilitate comparison and contrast between 
theories, as the reader seeks to evaluate each theory. 

The biblical text has come to us as written language, a 
reduced form of the original communication. The various linguis- 
tic theories have done much to illuminate that text and that com- 
munication. May this volume be one more step in furthering the 


pursuit of studying the text for the sake of understanding. 
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GENERATIVE LINGUISTICS AS A 
THEORETICAL FRAMEWORK FOR THE 
EXPLANATION OF PROBLEMATIC 
CONSTRUCTIONS IN BIBLICAL HEBREW’ 


Jacobus A. Naudé and Cynthia L. Miller-Naudé 


1.0. Introduction 


Do linguistic theories, in general, and generative linguistics, in 
particular, benefit the interpretation of a biblical text in any way 
and, if so, how? This is the question addressed in this chapter. 
In the meaning-making process of textual interpretation, 
linguistic knowledge provides interpreters of the text with 
knowledge of language structure, metalanguage, and methodol- 
ogy through which they can describe and explain problematic 


instances of language use in the text under consideration. The 


' This work is based on research supported in part by the National 
Research Foundation of South Africa (Jacobus A. Naudé UID 85902 and 
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that opinions, findings, and conclusions or recommendations expressed 
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process of linguistic inquiry proceeds by observation, description, 
and explanation of language use. Explanation of language use is 
the stage at which linguists endeavour to establish the underlying 
rules that speakers internalise to construct and use sounds, 
words, and sentences, as well as the ways in which meanings are 
assigned to these units of language in order to communicate 
(Naudé and Miller-Naudé 2020, 15). By developing a consistent 
analysis of the systematic regularities in the language of a text, 
linguistics contributes to interpretation (Traugott and Pratt 1980, 
20). A linguistic reading that is sensitive to the grammatical, 
sociological, and psychological aspects of language (such as 
choices of words and sentence types) possesses conventional 
reverberations and associations for readers (Fowler 1977, 4). We 
will argue that the linguistic knowledge contributed by linguistic 
theories, in particular generative linguistics, is indispensable for 


the text interpretation process. 


2.0. Historical Development and Concepts of 


Generative Linguistics 


2.1. Goal of the Generative Enterprise 


Noam Chomsky (1928-), Institute Professor Emeritus in the 
Department of Linguistics and Philosophy at the Massachusetts 
Institute of Technology, introduced generative grammar with his 
book Syntactic Structures (1957). The main purpose of the genera- 
tive enterprise is to suggest an explanatory hypothesis concerning 
the nature of language and ultimately human thought (Chomsky 
1982a, 5-58; 2019, 265-66). Accordingly, the object of linguistic 
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study from a generative point of view is the knowledge that 
native speakers possess that enables them to produce and under- 
stand sentences. A generative grammar is thus an analysis of the 
mental mechanisms that enumerate all the grammatical sen- 
tences of a language—and only those that are grammatical—and 
assign to each an appropriate structural description (Chomsky 
1965, 4-5). Chomsky (1986a, 3; 1991a, 6) formulated three basic 
questions that frame this inquiry: (1) what constitutes knowledge 
of language, (2) how is such knowledge acquired, and (3) how is 
such knowledge put to use? 

Concerning the first question, the internalised knowledge 
of speakers is called ‘language competence’ (Chomsky 1965, 3- 
9). It refers to the knowledge of finite sounds and rules of pho- 
nology, finite words and rules for the formation of meaningful 
words, as well as finite rules for the production of an infinite 
number of sentences and their meanings. The notion of language 
as generative (i.e., language as a system that provides for infinite 
use of finite means) lies at the heart of generative linguistics and 
provides a solution to Von Humboldt’s problem of what consti- 
tutes knowledge of language (Chomsky 1991a, 6-14). 

The second problem is known as ‘Plato’s problem’: how is 
it possible that children acquire language so early, effortlessly, 
and with so little experience with language data, in comparison 
to the acquisition of number systems and writing systems, which 
take many years to learn? Furthermore, how is this rich system 
of language knowledge shared (Chomsky 1986a, 51-220; 1991a, 
15-17)? Chomsky proposed that the innate component of the lan- 


guage faculty as represented in the mind/brain makes early child 
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language acquisition effortless. This biologically innate language 
faculty (or Universal Grammar, UG) helps children to make sense 
of the language data to which they are exposed and to build an 
internal grammar (I[nternal]-language; Chomsky 1986a, 21-24), 
which is used to produce the sentences the children utter 
(E[xternal]-language; Chomsky 1986a, 19-21). 

The third question concerns the individual’s use of lan- 
guage knowledge in acts of communication—namely speech pro- 
duction and speech perception—which Chomsky (1965, 10-15) 
called ‘language performance’. In this regard, Descartes and his 
followers made three observations (Chomsky 1972a, 5-14). 
Firstly, normal use of language is innovative; utterances are 
mostly new and not a repetition or even similar in pattern to 
previous utterances. This is similar to Chomsky’s (1957, 15) view 
that a speaker is able to understand/interpret and produce an 
infinite number of new, previously unheard utterances. Secondly, 
humans do not have to communicate in response to stimuli or 
instinct; humans are free to think and express themselves at will. 
This is similar to Chomsky’s (2002 [1966], 52-62) view that 
normal use of language is free from stimulus control and can 
therefore serve as an instrument of thought and self-expression. 
Thirdly, the normal use of language is coherent and appropriate 
to the situation. This creative aspect of language use provides an 
answer to Descartes’s problem of how knowledge of language is 
put to use (Chomsky 2002 [1966], 51-71; 1991a, 15-19). One 
can explain the complex creative linguistic behaviour of humans 
only by concluding that it is determined by intrinsic properties 


of mental organisation. This happens as an internal grammar 
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(I-language) generates structural descriptions or representations 
that specify the linguistic elements for each expression (E- 
language; Chomsky 1964c, 7-9; 2002 [1966], 72-93; 1991a, 17- 
20). 

The goal of generative linguistics is therefore “to construct 
a formalized general theory of linguistic structure” to account for 
these states of affairs and provide solutions for many other 
related problems for which the theory was not actually designed 
(Chomsky 1957, 5). To achieve this goal, focus is specifically on 
“syntactic structure, both in the broad sense (as opposed to 
semantics) and the narrow sense (as opposed to phonemics and 
morphology)” (Chomsky 1957, 5). Although generative linguists 
use deduction and intuition to construct theories of language 


structure, they test them against actual language data. 


2.2. Autonomy of Syntax 


Chomsky’s Syntactic Structures (1957) introduced the theoretical 
study of syntax. To derive sentences, Chomsky (1957, 45-46) 
formalised a tripartite arrangement of grammar, consisting 
of phrase structure, transformational structure, and morpho- 
phonemics. The output of phrase structure is a sequence of mor- 
phemes, though not necessarily in the correct order. The rules of 
the transformational structure may rearrange, add, or delete mor- 
phemes to yield a string of words. The morphophonemic rules 
convert this string of words into a string of phonemes. In short, 
to encode dependencies between sentence parts (word order), a 
system of phrase structure rules produces basic sentences, while 


transformational rules derive all other sentences from these basic 
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sentences. For example, active sentences are considered basic, 
whereas passive sentences are derived from active sentences 
by means of transformational rules (Chomsky 1957, 42-43). 
Accordingly, transformations must involve important semantic 
consequences. 

Although Chomsky (1957, 13-17, 92-105) considered the 
question of how the meaning of a sentence is related to its syn- 
tactic form, his conclusion was that grammar is “best formulated 
as a self-contained study independent of semantics” (106). To 
avoid misunderstanding of the nature of the relationship between 
syntax and semantics, this conclusion must be understood in light 
of the following statement by Chomsky (1957, 102): 

We can judge formal theories in terms of their ability to 

explain and clarify a variety of facts about the way in 

which sentences are used and understood. In other words, 

we should like the syntactic framework of the language 

that is isolated and exhibited by the grammar to be able to 

support semantic description, and we shall naturally rate 

more highly a theory of formal structure that leads to 

grammars that meet this requirement more fully. 

Instead of viewing the emergence of the theory of linguistic 
structure as a succession of different models with numerous mod- 
ifications, it must rather be viewed as a single model that is con- 
tinually subject to critical assessments in terms of new questions 
and insights, and that is accordingly modified (Van Riemsdijk 
and Williams 1986, 171). The various terms used to designate the 
model are used to refer to the respective periods in the history of 
generative linguistics over nearly seven decades. The following 


sections provide an overview of these respective periods. 
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2.3. Standard Theory Model 


The success of a linguistic theory is tested for adequacy by the 
degree to which the structures generated by the syntactic rules 
form a suitable basis for semantic interpretation. In 1962, 
Chomsky (1964a, 936; 1964b, 77; 1964c, 51) considered the 
incorporation of semantics into generative grammar: 
In general, as syntactic description becomes deeper, what 
appear to be semantic questions fall increasingly within its 
scope; and it is not entirely obvious whether or where one 
can draw a natural bound between grammar and ‘logical 
grammar’, in the sense of Wittgenstein and the Oxford 
philosophers. Nevertheless, it seems clear that explanatory 
adequacy for descriptive semantics requires, beyond this, 
the development of an independent semantic theory 
(analogous, perhaps, to the theory of universal phonetics 
as mentioned below) that deals with questions of a kind 
that can scarcely be coherently formulated today, in 
particular, with the question: what are the substantive and 
formal constraints on systems of concepts that are con- 
structed by humans on the basis of presented data? 


This role of semantics was addressed by Katz and Fodor (1963) 
and Katz and Postal (1964; see Chomsky 1964c, 14). In the same 
way that Chomsky (1957) wanted to make explicit what the 
speaker knows of syntactic structure, they wanted to make 
explicit what the speaker knows of the meanings of words and 
sentences. These proposals led to the incorporation of semantics 
into the model known as the Standard Theory (ST; Chomsky 
1969, 5), which was described by Chomsky (1965, 132-36). 
The ST exhibits a triangular organisation containing a 


syntactic component and two interpretive components, one 
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phonological and one semantic (Chomsky 1964c, 9-10; 1965, 
15-18; Katz and Postal 1964, 161). Chomsky (1965, 135) puts 
forward the hypothesis that semantic interpretation is made only 
in the underlying or deep structure. Specifically, the syntactic 
component consists of a base sub-component and a transforma- 
tional sub-component (Chomsky 1965, 106-11, 141). The former 
generates a deep structure where words from the lexicon are 
inserted into a preterminal string of dummy elements of the 
categorial part (Chomsky 1965, 120), which is submitted to the 
semantic component for semantic interpretation. Without chang- 
ing meaning, the transformational sub-component maps deep 
structure into surface structure by reordering elements (Chomsky 
1965, 123-24), which enter the phonological component for 
phonetic interpretation by the rules for pronunciation (Chomsky 
1965, 135). 

The interpretive components play no part in the recursive 
generation of sentence structures (Chomsky 1965, 141). Syntax 
is self-contained (see Chomsky 1965, 226 n. 15). For the inter- 
pretation of a sentence, the syntactic component of the grammar 
relates a semantic interpretation of the underlying structure to a 
phonetic representation on surface level (Chomsky 1965, 136). 
Chomsky (1965, v—vi) clearly states that this hypothesis is merely 


tentative and emerging. 


2.4. Generative Semantics 


Katz and Postal (1964, 71) argued that semantic interpre- 
tations are uniquely assigned to deep structures, a viewpoint 


also reflected in the ST. This requirement means that the 
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transformational sub-component of the syntactic component does 
not contribute in any way to semantic interpretation. “Trans- 
formations do not affect meaning” became known as the Katz- 
Postal Hypothesis (Chomsky 1965, 132, 135; 1977a, 140-43). 
Katz and Postal (1964, 72-156) defend their hypothesis by re- 
examining apparent counterexamples, for example, the deriva- 
tions of active/passive, negatives, imperatives, questions, etc., 
and claim that “their present formulation is not entirely ade- 
quate” (71). Unsurprisingly, this model, known as ‘interpretive 
semantics’ (Chomsky 1977a, 145), was soon challenged, espe- 
cially because of the exclusive link it postulated between seman- 
tics and deep structure. 

In his exposition of the ST, Chomsky (1965, 224 n. 9) 
already expressed his doubts about the Katz-Postal Hypothesis. 
Chomsky (1977a, 151) mentioned the critique of Ray Jackendoff 
as the first to show that surface structure plays a role in semantic 
interpretation. Chomsky (1972a, 100-14) argued that this 
restriction on the nature of transformations is too strong; there 
are cases where transformations have semantic effects. Both deep 
and surface structure enter into the determination of meaning: 
deep structure inter alia for predication and modification, and 
surface structure for focus, presupposition, topic and comment, 
the scope of logical elements (including quantifiers), and pro- 
nominal reference, a view that eventually led to the Extended 
Standard Theory model (see 82.5). 

George Lakoff laid the foundations for the development of 
Generative Semantics in his dissertation published in 1970. 


Lakoff and others argued that, if the deep component were of a 
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semantic rather than a syntactic nature, and if the difference 
between the semantic component and the deep syntactic compo- 
nent were erased, meaning differences could all be expressed as 
an underlying semantic representation. Consequently, the inde- 
pendent deep syntactic component of the ST would cease to exist 
as an independent level of description and become indistinguish- 
able from the semantic level. From 1966 until the early 1970s, 
the theory of Generative Semantics was developed, which 
replaced the base of the ST with an abstract semantic level from 
which meanings were mapped into surface structures by trans- 
formations according to a unilinear structure of input-output 
relations (Lakoff 1968, 1-84; 1971, 232-96). This approach is 
known as the transformationalist position (Chomsky 1972b, 17). 

Chomsky argued that transformational derivation in 
Generative Semantics cannot naturally capture structures that 
exhibit significant syntactic differences, when they are assigned 
identical underlying structures (Chomsky 1966, 48-49; 1972b, 
11-61). Jackendoff (1972, xi) opted for the formulation of proper 
rules of semantic interpretation to account for semantic phenom- 
ena and to leave the syntactic component as free of semantic 
intervention as it was in Syntactic Structures. Katz (1972; 1977) 
continued to pursue the basic approach to semantics as exposed 
in Katz and Fodor (1963). 

Although Generative Semantics was eventually unsuccess- 
ful because of the failure to distinguish between syntactic and 
non-syntactic properties of sentence structure, it served as a cata- 
lyst for the further study of semantics (Van Riemsdijk and 
Williams 1986, 88). 
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2.5. Extended Standard Theory Model 


In 1972, Chomsky published a more refined theory of semantic 
interpretation. Chomsky (1972b, 11-202) described in three 
essays the shortcomings that arose within the ST, resulting in the 
emergence of the Extended Standard Theory (EST; Chomsky 
1970, 10). In EST, the semantic component does not operate 
solely on the output of the deep syntactic component, but also on 
the output of the transformational component (Chomsky 1972b, 
5). This constitutes the Chomskyan departure from the ST set out 
in Katz and Postal (1964) and Chomsky (1965). In EST, semantic 
representation is shared between the underlying structure (for 
thematic and case structure) and the surface structure (for rules 
of anaphora and quantification; Chomsky 1972b, 62-119). In 
other words, the semantic component relates both the deep struc- 
ture and the surface structure to a semantic representation. 
However, the grammatical relations expressed in a sentence are 
inherent in the syntactic deep structure. A separate semantic 
component with a purely interpretive role is still assumed 
(Chomsky 1972b, 120-202). 

Chomsky (1972b, 11-61) also extends grammatical theory 
to incorporate syntactic features, thus formulating the ‘lexicalist 
position’. He also presents evidence that EST is to be preferred 
over Generative Semantics on methodological as well as empiri- 
cal grounds (62-202). 


2.6. Revised Extended Standard Theory Model 


Chomsky (1975a) proposed further theoretical refinements, in 
particular the Revised Extended Standard Theory (REST), where 
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semantic representation appears only in the surface structure. 
The syntactic and semantic properties of the deep structures of 
the ST and the EST are dissociated (Chomsky 1975a, 81-82). 
Chomsky (1975a, 82) also dropped the term ‘deep structure’, 
utilising instead ‘initial phrase marker’, but retaining the term 
‘surface structure’ (see also Chomsky 1977a, 169-79). This theo- 
retical change was made possible by the introduction of ‘traces’ 
(called ‘copies’ since 1995), which mark the original positions of 
moved elements in the underlying structure (Chomsky 1975a, 
86-103; 1977a, 165). As a result, the rules of thematic and case 
structure were applied to surface structure, thereby simplifying 
the semantic component (Chomsky 1975a, 116-18). 

Accordingly, derivations within the REST model are as 
follows (Chomsky 1975a, 103-5; 1977a, 165). The rules of the 
categorial component (i.e., the lexical categories) and the lexicon 
provide initial phrase markers. The rules of the transformational 
component convert the initial phrase markers to surface struc- 
tures (including traces), which undergo semantic interpretation 
(involving bound anaphora, scope, thematic relations, etc.) to 
convert the surface structures into logical forms (Chomsky 
1977a, 165-66). For fuller representation of meaning, the gene- 
rated logical forms are subject to further interpretation by other 
semantic rules that interact with other cognitive structures. 

The picture which emerged with the REST model suggests 
that the grammar consists of various autonomous components 
(syntax, phonology, semantics), which have their own rules and 
interact with each other only at the relevant interface (Van 
Riemsdijk and Williams 1986, 174). In the description and 
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explanation of a particular language, each of these components 
and their interrelations must come into its own. 

Enriched with idiosyncratic transformational rules, the 
model at this stage faced the problem of how to characterise the 
initial state of the language faculty (to have explanatory ade- 
quacy) and show how it maps language experience to the final 
state attained (to have descriptive adequacy). To achieve explan- 
atory adequacy, a theory of the initial state can allow only limited 
variation. This insight led to a refinement of generative grammar 
with the shift towards the Principles-and-Parameters approach 
(Chomsky 1981a; 1991b, 417), discussed below, where the initial 
state of language is represented as a system of innate formal 
principles with associated open parameters, the values of which 


are fixed by linguistic experience for each specific language. 


2.7. Principles-and-Parameters Approach 


So far the emphasis was on the similarities between languages. 
The question was how to account for the differences between 
languages. Chomsky realised that the apparent complexity and 
variety of languages can be viewed as superficial, the result of 
minor changes in a fixed and invariant system. Accordingly, it is 
possible to attribute only limited variation to the innate language 
faculty (or UG) so that explanatory adequacy can be achieved. 
In a series of publications, Chomsky (1977b; 1977c; 1978; 
1980; Chomsky and Lasnik 1977) dismantled the rich idiosyn- 
cratic transformational rules of the previous models. The rethink- 
ing of many problematic aspects of the theory of grammar during 
this period is discussed in Chomsky (1982a, 61-120). The process 
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resulted in the Pisa Lectures (Chomsky 1981b), which are an 
exposition of the Principles-and-Parameters approach (Chomsky 
1991b, 417) or so-called Government-Binding theory—the prin- 
ciple of ‘government’ relates items to one another; the princi- 
ple of ‘binding’ determines which nominals in a sentence are 
co-referential. Chomsky (1991b, 448 n. 1) depicted the name 
Government-Binding theory as “a misleading term that should 
be abandoned.” The model consists of a subsystem of rules and 
a subsystem of parameterised principles. Chomsky (1982b, 7) 
remarked: 

In the course of this work, there has been a gradual shift 

of focus from the study of rule systems, which have 

increasingly been regarded as impoverished (as we would 

hope to be the case) to the study of systems of principles, 

which appear to occupy a much more central position in 

determining the character and variety of possible human 
languages. 
According to Chomsky (1991b, 417), “a language is not, then, a 
system of rules, but a set of specifications for parameters in an 
invariant system of principles of Universal Grammar (UG).” 

The subsystem of rules is reduced as follows: the notions of 
deep- and surface-structures have been renamed D-structure and 
S-structure, respectively, since their roles are similar but not 
identical. The levels of D-structure and S-structure are mapped 
by the transformational operation of Move a (‘move alpha’), 
where a is a variable, meaning that a structure may be altered 
by movement of one or more elements; independent principles 
determine what can move and where it can move (Chomsky 


1981b, 5). Semantic representation is replaced by the notion 


20 Naudé and Miller-Naudé 


Logical Form (LF) as a level of representation, which is a partial 
representation of the structural meaning of a sentence. Quantifier 
scope, the scope of question-words, and reflexive interpretation 
are examples that are dealt with on this level. Similarly, the pho- 
netic representation is replaced by Phonetic Form (PF; Chomsky 
1981b, 18). 

The subsystem of universal principles consists of bounding 
theory (subjacency), government theory, theta-theory (assign- 
ment of thematic roles), binding theory (co-referentiality of 
nominals), Case theory (assignment of abstract Case), and control 
theory (reference of abstract pronominals; Chomsky 1981b, 5-6). 
Each of these theories comprises a principle or set of principles. 
Each of the principles has a parameter which has to be set as 
plus/minus on the basis of language-specific evidence. Parame- 
tric settings account for variation across languages, as well as for 
language acquisition (Hyams 1986). The rules of a specific lan- 
guage are the principles of UG as parameterised for that specific 
language (Chomsky 1991b, 417). 

The setting of parameters provides the opportunity for a 
new comparative syntax to explain language variation histori- 
cally and cross-linguistically. An example is the null subject or 
pro-drop parameter, which distinguishes languages that do not 
allow a subject pronoun to be omitted (e.g., English) and those 
that do (e.g., Italian; Rizzi 1982, 117-84). Certain properties sys- 
tematically correlate with the null subject property, for example 
that an overt subject can occupy a postverbal position (Chomsky 
1981b, 240-48). In 83.2, the null subject or pro-drop parameter 
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is used to explain the syntactic distribution of independent per- 
sonal pronouns in Biblical Hebrew. 

Generative grammar, and specifically the Principles-and- 
Parameters approach, is couched in traditional grammatical 
terminology. Lexical categories include N(oun), V(erb), A 
(adjective/adverb), and P(reposition); functional/grammatical 
categories include D(eterminer), T(ense), Asp(ect), M(ood), and 
C (complementiser/coordinator). Whereas lexical categories are 
acquired, can be translated, borrowed, and have meaning, func- 
tional grammatical categories do not have lexical meaning, are 
rarely borrowed, and may be contracted or lack stress. All of 
these categories project into phrases (i.e., groups of words that 
belong together), which are named after their most important 
part, the head. Phrases may be formed from lexical categories 
(verb phrase, noun phrase, etc.) and functional/grammatical 
categories (determiner phrase, tense phrase, etc.). 

From 1982 to 1991, Chomsky proposed numerous refine- 
ments to the Principles-and-Parameters approach (see 1982b; 
1986a; 1986b; 1988; Chomsky and Lasnik 1993). The most 
important of these refinements (which directly influenced exis- 
ting syntactic analyses of Biblical Hebrew syntax and have impli- 
cations for further analyses) concerns functional projections 
(Pollock 1989; Abney 1987; Rizzi 1997). These include the VP- 
Internal Subject Hypothesis, which proposes that the thematic 
position of the subject is internal to the VP (verb phrase), as well 
as the Split INFL (Inflection) Hypothesis, which proposes that 
Inflection be decomposed into two separate functional heads, 
AGR (Agreement) and T (Tense; Pollock 1989). These two 
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functional projections have implications for the analysis of the 
infinitive in Biblical Hebrew as well as for verb agreement and 
pro-drop (see 883.2, 3.4, 3.5, and 3.8). A third functional projec- 
tion is the DP (determiner phrase) Hypothesis, which proposes 
that noun phrases are in fact projections of a functional head, the 
determiner (D; Abney 1987). This functional projection has 
implications for the analysis of the quantifier 52, which may 
occur inside or outside of the DP to produce different meanings 
(see §3.6). A fourth innovation is the Split CP (complementiser 
phrase) Hypothesis, which concerns “the fine structure of the left 
periphery” (Rizzi 1997, 281). A central point of departure is that 
the discourse-pragmatic orientation of sentences is expressed 
within the CP domain. Accordingly, the C (complementiser) head 
is divided into four heads, each with its own projection, namely, 
Topic, Focus, Force (overt morphological encoding for declara- 
tives, questions, relatives, etc.), and Finiteness (expresses a dis- 
tinction related to tense and other inflectional specifications). 
This refinement accounts for at least the distribution of the 
wayyiqtol/waw consecutive construction as well as for left 


dislocation and topicalisation in Biblical Hebrew (see 83.3). 


2.8. Minimalist Programme 


Considerations of conceptual naturalness that have some inde- 
pendent plausibility, which the human language faculty should 
be expected to satisfy—such as simplicity, economy, symmetry, 
and nonredundancy (see Chomsky 1975b, 113-28; 1991b, 417- 
54; 2019, 264-65)—led to further refinements in the 1990s with 


the Minimalist Programme, the most recent instantiation of 
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generative syntax. Despite some differences, many of the con- 
cepts developed in the Principles-and-Parameters approach carry 
over to the Minimalist Programme and the challenge is how to 
integrate them (e.g., V[erb]-movement). 

Chomsky introduced the Minimalist Programme in three 
publications (1992; 1994; 1995). In the main source (Chomsky 
1995), the computational system of language (i.e., the syntactic 
component) is assumed to contain only what is necessary to build 
representations that connect meaning to sound (or signs or wri- 
ting); these representations are the same for all languages. In 
other words, the computational system of language serves as 
input to the semantic component, which maps (or converts) the 
syntactic structure into a corresponding semantic representation 
(i.e., a representation of linguistic aspects of its meaning). The 
semantic representation interfaces with systems of thought, 
namely, the Conceptual-Intentional system (responsible for inter- 
pretation and non-linguistic knowledge). The computational sys- 
tem of language serves also as input to the Phonetic Form (PF) 
component, which maps the syntactic structure into a PF 
representation (i.e., a representation that provides a phonetic 
‘spellout’, or pronunciation, of sentences). The PF representation 
interfaces with the Sensory-Motor system (responsible for exter- 
nalising the derivation in speech systems, i.e., providing spoken, 
but also signed or written, representations). The Minimalist 
model for deriving a sentence involves making a selection from 
the lexicon (which contains all the lexical items in a language as 
well as their linguistic properties) and then using the compu- 


tational operation ‘Merge’ to bring these items together from 
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bottom to top. Merge includes what was previously referred to as 
‘move’ (e.g., when the merging element is taken from inside the 
derivation and copied). Refinements of the Minimalist Programme 
followed (Chomsky 1998; 1999). 

Derivations and structural representations have become 
extremely bare in the Minimalist Programme. In recent years, the 
focus has shifted from a rich UG to innate mechanisms that are 
part of more general cognitive principles of organic systems 
(Chomsky 2004; 2005; 2007; 2013; 2015; 2019): 

a) Chomsky (2004; 2005) identifies the factors that are cru- 
cial in the development of language, namely, the previous two 
factors—UG and experience (the input of a specific language)— 
and a new factor, principles of efficient computation (not specific 
to the language faculty), which include the Economy Principles. 

b) Chomsky (2007) follows Borer (1984), where parame- 
ters are seen as choices of feature specifications as the child 
acquires a lexicon, the so-called Borer-Chomsky Conjecture (see 
Baker 2008, 156). The computational system of every language 
is identical, but the parametric choices are lexical and account 
for the variety of languages, as well as, for example, the determi- 
nation of linear order. 

c) Representation by means of trees is common in genera- 
tive grammar, but they are no longer used in the Problems-of- 
Projection approach to phrase/clause structure, according to 
Chomsky (2013; 2015). A derivation is not labelled when it is 
built. The labelling is done when the syntax hands over its com- 
bined sets to the interfaces, the interaction between components 


of grammar. 
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d) Chomsky (2019) reviews some foundational aspects of 
the theory of structure building—essentially, Merge and Label— 
while still accounting for recursive structure, displacement, and 
reconstruction (as the main empirical goals of the Minimalist 
Programme). 

To summarise, Chomsky’s vision of a fixed syntactic com- 
ponent with cross-linguistic variation triggered by differences in 
the lexicon continues as the most productive avenue of research 
in the Minimalist Programme, whose core assumption is that 
grammars are minimally complex systems of optimal design with 
parameters set by features of lexical items. The main innate 
mechanism of UG is seen as Merge, an operation by which two 
constituents are combined to form a single larger constituent 
(e.g., a complement merges first with its head and then the spec- 
ifier of the head is merged to this complex). In other words, a 
derivation starts with a selection from the lexicon and then 
merges these elements from bottom to top. The derivation pro- 
ceeds as follows: a verb phrase (VP), which includes the verb with 
its aspectual information and the arguments with their thematic 
roles; a tense phrase (TP), which connects the VP to information 
on finiteness, tense, agreement, and case; a complementiser 
phrase (CP), which connects the TP to pragmatic information 
(mood, topic, focus) or to another clause. Movement of constitu- 
ents may include movement of topic, focus, and wh-elements 
(that is, a question/interrogative word or relative item, such as 
what, who, which, when, why, how) as well as head-movement of 
the verb (V) and tense (T). 


26 Naudé and Miller-Naudé 


The most productive work using generative grammar to 
explain syntactic constructions in Biblical Hebrew has been done 
in terms of the Principles-and-Parameters approach and the 
refinements in the Minimalist Programme. An overview of these 


contributions will be provided in the following section. 


3.0. Contributions of Generative Linguistics to 
Biblical Hebrew 


3.1. Introduction 


As is clear from the overview of the development of Chomskyan 
generative linguistics in the preceding section, the focus has been 
on the syntactic structure of language. For this reason, the exam- 
ples selected for discussion in this section relate primarily to syn- 
tax. The focus on syntax, however, should not detract from the 
fact that generative linguistics has developed and contributed to 
other linguistic subdisciplines (e.g., phonology, morphology, 
semantics, pragmatics), which have also benefitted Biblical 


Hebrew.? 


? These linguistic subdisciplines include: phonology and morphology 
(e.g., Prince 1975; Rappaport 1984; McCarthy 1985 [1979]; Malone 
1993; Dresher 1994; Churchyard 1999; Coetzee 1999; DeCaen 2003; 
Dresher 2009a; 2009b; Himmelreich and Bat-El Foux 2021); 
phonological aspects of the Masoretic accentual system (e.g., Dresher 
1994; DeCaen and Dresher 2021; Pitcher 2021); and the morphology, 
syntax, and semantics/pragmatics of the Biblical Hebrew verbal system 
and verb phrase (e.g., DeCaen 1995; Hatav 1997; DeCaen 1999; 
Harbour 1999; Pereltsvaig 2002; Hatav 2004; 2006; Holmstedt 2009; 
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3.2. Null Subjects, Constituent Order, and the 


Meaning-Making of Independent Pronouns 


An important typological classification based on the Principles- 
and-Parameters approach concerns the observation that lan- 
guages can be divided into those with grammatically optional sub- 
jects (i.e., null subject languages or pro-drop languages) like 
Hebrew and those with obligatory subjects (i.e., non-null subject 
languages or non-pro-drop languages) like English. In this regard, 
Chomsky (1981b, 65) introduced the ‘Avoid Pronoun Principle’, 
which imposes the choice of a null subject over an overt subject 
pronoun where possible. The availability of null subjects in 
Hebrew correlates with the inflectional richness of agreement on 
the finite verb form (Borer 1989). Naudé (1991a; 1993b; 1994a; 
1994b) demonstrates the specific aspects of null subjects for BH 
and Qumran Hebrew, where the qatal/perfect, yiqtol/imperfect, 
wayyigtol, and waqatal verb forms allow null subjects. Subject 
pronouns that are used with null subject verb forms can be 
utilised only as subject topics (see 83.3 below on topicalisation 
and the differences between Biblical Hebrew and Qumran 
Hebrew in this regard). By contrast, participles functioning as the 
predicate in clauses do not allow null subjects. As a result, an 
overt subject with a participial predicate, as in (1), is not 
‘marked’ for topic or focus; it is neutral because an overt subject 


is grammatically required: 


Hatav 2011; 2017; 2020; Boulet 2021; Cowper and DeCaen 2021; 
Doron 2021; Grasso 2021; Hatav 2021). 
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(1) oyna Tina xyP ay adn nna 

‘Towards midnight I will go out in the midst of the 

Egyptians.’ (Exod. 11.4) 

The ‘Avoid Pronoun Principle’ of the Principles-and- 
Parameters approach and the Economy Principles of the Mini- 
malist Programme explain the distribution of independent sub- 
ject pronouns as opposed to null subjects in BH in a principled 
syntactic way for the first time. With the gatal/perfect and yiqtol/ 
imperfect, the subject pronouns as topics appear only in pre- 
verbal position, but with wayyiqtol and wagqatal verb forms they 
occur only in postverbal position. In the following section, we 
demonstrate that there are syntactic mechanisms that explain 
this asymmetry. 

Government and Binding also provides insight into the syn- 
tax of BH participial forms. For example, the structure of BH 
allows the use of participles in attributive constructions while 
maintaining the verbal characteristics of the verbal form (i.e., its 
verbal valency; e.g., Num. 13.32, pax ANS WIN? AD TAY WR PINA 
xin Maw nox ‘the land which we crossed into it to explore it is 
a land eating its inhabitants’). English, in contrast, does not allow 
attributive participial constructions, but requires that a relative 


clause be used. 


3.3. Non-Canonical Constituent Order and Meaning- 


Making at Sentence Edges 


From its conception, one of the central concerns of generative 
linguistics has been to characterise the positions in which nom- 
inal elements can appear in a specific language, that is, word 


order (both canonical and non-canonical positions). In early 
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forms of generative linguistics, this was achieved by a transfor- 
mation (see §2.7) in Principles-and-Parameters and by ‘Merge’ in 
the Minimalist Programme, which also entails movement. These 
theoretical advances made it possible to formally identify and 
differentiate the constituents that occur at the edges of sentences, 
a necessary prerequisite for determining what each construction 
contributes to meaning.? 

A syntactic construction involving constituents moved to 
non-canonical positions at the initial sentence periphery is topi- 
calisation. This construction was described in Naudé (1994a) as 
involving movement to the preverbal topic position: 

(2) janingnsy Avy Agra 

‘Like a whore should he treat our sister jixe a whore?’ (Gen. 

34.31) 
In this example, the topicalised constituent (underlined) is a 
prepositional phrase that has been moved to the very beginning 
of the sentence from its normal position at the end of the sen- 
tence. A zero trace (or copy) marks the location where the con- 
stituent originally occurred in the sentence (indicated by sub- 
scripted type; see 82.6 and §3.8). As indicated in the previous 
section, the qatal/perfect and yiqtol/imperfect verbal forms in BH 
and Qumran Hebrew allow an independent subject pronoun as a 
subject topic before these verbal forms, but not after them 
(Naudé 2001). 

Naudé (1996a, 181) demonstrates that when the verb is 


one of the so-called consecutive verb forms in BH, this topic 


> Five of these six edge-constructions in BH were treated together for 
the first time from a generative perspective in Holmstedt (2014). 
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position is not preverbal but postverbal. Independent pronouns 

that occur with consecutive verbs in BH sentences are still subject 

topics, even though they occur postverbally: 

(3) NIT OPA ARR NRA AYT nya 20 NRA TON Tian NRA Nip? 
BBD WA TWS TMDL TZ TN 2 ON 

‘Take from the exiled community, from Heldai, Tobijah, 

and Jedaiah, who have come from Babylon—and (you) will 

go, you, proceed on that day—you will go to the house of 

Josiah son of Zephaniah.’ (Zech. 6.10) 

Naudé (1996a) argues that the sentence-initial position of the 
consecutive verb forms is the result of obligatory verb movement 
(‘verb raising’ in generative grammar). This parameter occurs in 
BH, but not in later forms of Hebrew (e.g., Qumran Hebrew and 
Mishnaic Hebrew) or Biblical Aramaic. As a result, consecutive 
verb forms in Qumran Hebrew, for example, are not attested with 
the independent subject pronoun following the verb. Generative 
grammar thus provides a principled means to distinguish the 
appearance and absence of independent subject pronouns with 
finite, pro-drop verbs. It also assists in identifying the locus of 
both language difference and language change (see further 83.9 
below). 

The second construction, left dislocation (identified in 
Naudé 1990), involves a constituent that occurs outside of the 
sentence (indicated by double underlining) and has a resumptive 
element that occurs within the sentence (indicated by under- 
lining and bold type): 

(4) Ania of ty Ty Ad mace Digwena Don 
‘To Michal daughter of Saul, there was to her no child (i.e., she 
had no child) until the day of her death.’ (2 Sam. 6.23) 
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In this example, the resumptive element occurs in situ, that is, in 
the normal position of the constituent in the sentence. It is also 
possible for the resumptive element itself to be in a preverbal 
topic position so that it occurs at the beginning of the sentence 
proper: 
(5S) 773238 DAD MAR Wy NinpaxT oy) 
‘..and among the slave girls that you speak of, among 
them I will be honoured among them: (2 Sam. 6.22) 


In other words, the example exhibits both left dislocation of a 
constituent and its resumption in the matrix sentence, and also 
topicalisation of the resumptive element within the matrix sen- 
tence. Because topicalisation and left dislocation may occur in 
the same sentence, they must be distinct constructions. Analyses 
that indiscriminately merge the two constructions into ‘fronting’ 
(or casus pendens) obscure the distinctive features and thus the 
meanings of the constructions. 

A third construction is like topicalisation in having no 
resumptive element within the sentence, although a constituent 
apparently occurs outside of the left edge of the sentence. Naudé 
(1990, 124; see also 1999), who first identified this construction, 
referred to it as a variety of topicalisation. Holmstedt (2014) 
refers to it as ‘heavy topic fronting’. We use the term ‘heavy 
topicalisation’ (Miller-Naudé and Naudé 2019; 2021): 

(6) Ara 7aTa pan np? TAT TN 
‘,..but (as for) my lord the king, why does he delight jy tora 
the king IN this thing?’ (2 Sam. 24.3) 
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In this example, the initial boundary of the matrix sentence is 
indicated by the interrogative particle, but the constituent out- 
side of the left edge of the sentence is not resumed within the 
sentence proper. 

The fourth construction is extraposition. In the same way 
that topicalisation moves a constituent out of its canonical posi- 
tion to the preverbal topic position at the beginning of a sentence, 
extraposition involves a constituent that is moved to the end of 
the sentence outside of its canonical position: 

(7) THAN Way DARI PRI yp 
“|++the men who battle against you Will be as nothing and non-existent 
the men who battle against you.’ (Isa. 41.12) 


Like topicalisation, extraposition does not involve resumption 
within the matrix sentence.* 

The fifth construction is right dislocation. In many ways 

this construction is the mirror image of left dislocation in that a 

constituent occurs outside of the end-periphery of the sentence 
and a co-referential element occurs within the sentence proper: 

(8) qwE-NID Yea T2D OvAy 7201 ANTS T2791 TAY 72791 DTO"TZA RN 

[DTN OY 72 WITTE MY OWA pRVA AN? OMS Iw 

“HY DD7D AVIS TOPS 779 THIN) WI 77! VWI Ova 779 

nvana 

‘The king of Sodom and the king of Gomorrah and the king 

of Admah and the king of Zeboiim and the king of Bela, 

which is Zoar, went forth and engaged them in battle in 

the Valley of Siddim—Chedorlaomer_king of Elam and 


“For an approach to the differentiation of extraposition from extraposed 
apposition, see Holmstedt and Jones (2017, 42-47). 
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Tidal_king of Goiim_and_Amraphel king of Shinar _and 


Arioch king of Ellasar—four kings against five.’ (Gen. 14.8- 
9) 
The sixth construction is heavy extraposition, which is the 


counterpart of heavy topicalisation in having a constituent that 
is moved beyond the end-periphery of the sentence without 
resumption inside the sentence. It was identified for the first time 
in Miller-Naudé and Naudé (2019): 
(9) nan 87 DYN NI} AW? OTN DDT 
‘And in a few days he will be broken jot by anger and not in battle ANd 
not by anger and not in battle.’ (Dan. 11.20) 


If the waw is understood as indicating a sentence boundary, then 
the negated prepositional phrases occur outside of the sentence 
boundary. Another possible example of heavy extraposition occurs 
in the following example, where a waw introduces a prepositional 
phrase: 

(10) AAT OR] Tw ONIN DIDNT OPIN WRI TZN 

‘and I made the stench of your armies rise ;, your nostrils ANd 

in your nostrils. But you did not turn back to me—declares 

the LORD.’ (Amos 4.10) 

Furthermore, generative linguistics allows for the differen- 
tiation of various kinds of left dislocation on the basis of the 
nature of agreement features between the dislocated element and 
the resumptive element. In addition to the heavy topicalisation 
construction discussed above, Miller-Naudé and Naudé (2021) 
identify five kinds of left dislocations, namely: clitic left disloca- 
tion, where a dislocated constituent exhibits case agreement with 
its resumptive (11); hanging topic, where the dislocated constitu- 


ent is always a noun phrase, but the resumptive within the matrix 
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sentence may bear any grammatical relation to the predication 


(12); 


left dislocation with deictic resumptive which is corefer- 


ential with the dislocated constituent (13); left dislocation with 


independent pronoun resumptive (14); and left dislocation with 


a noun phrase resumptive (15). 


(11) 


(12) 


(13) 


(14) 


(15) 


DONNY) DX POAVIF-NN) TNArNy 

‘Your wickedness and your abominations, you bear them’ 
(Ezek. 16.58) 

DON72 TAS PAN Pax 

‘North and south, you created them’ (Ps. 89.13) 

17, We 17732? Is DRT NN? NYRI *NI37 


‘,..and for my daughters, what should I do for these today 


or for their sons which they have borne?’ (Gen. 31.43) 
029 ond? gin 0299 TNT ODADN IT 
‘The LORD your God who goes before you, he will fight for 
you.’ (Deut. 1.30) 
poy ingnyy) mn? Wy DA?wA Naya Wa DONATWE waIM 
WANA NIT waa ANN 


‘But the person who eats flesh from the sacrifices of 


wellbeing which belong to the LORD and his uncleanness 


is upon him, that person shall be cut off from his relatives.’ 
(Lev. 7.20) 


A further insight was the discovery of the ‘frame of reference’ 


construction, which is neither left dislocation nor topicalisation, 


but rather a detached noun phrase, which provides the frame of 


reference for the discourse that follows (Miller-Naudé and Naudé 
2021): 
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(16) oPipcty qPapa NIT TD ANN) ay 727 Wy TIT 
‘And the word which we spoke, I and _you—behold the 


LORD is between me and between you forever.’ (1 Sam. 
20.23) 


3.4. Pronominal Syntax and the Meaning-Making of 


Pronominal Reference 


The so-called tripartite verbless clause in BH consists of two nom- 
inal phrases and a 3ms pronominal element. In the differentiation 
of the pronominal element, it is either a resumptive element of a 
left dislocation construction or it is a ‘last resort’ syntactic strat- 
egy in which the pronominal element is a pronominal clitic, 
providing agreement features for the subject in order to prevent 
ambiguity in the assignment of subject and predicate (Naudé 
1990; 1993b; 1994c; 1999; 2002a; 2002b). Disjunctive and con- 
junctive accents provide important evidence for prosodic phras- 
ing, which can be utilised for differentiating the role of the 
pronoun in these two types of sentences, which are otherwise 
structurally identical (Naudé and Miller-Naudé 2017). Cross- 
linguistically, left dislocation involves a ‘gap’ at the boundary 
between the dislocated constituent and the matrix sentence. In 
spoken language, this gap may be realised by a small pause or an 
interjection (Berman and Grosu 1976); in the MT it is marked by 
a disjunctive accent. In this case the pronominal element is a 
resumptive of a left dislocation construction: 
(17) TRF NI TIT 

‘And (as for) David, he was the youngest.’ (1 Sam. 17.14) 

Where it is marked by a conjunctive accent, there is no gap 


and the pronominal element is utilised as a ‘last resort’ syntactic 
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strategy. The pronominal element is a pronominal clitic, which is 
used to prevent ambiguity in the assignment of subject and predi- 
cate (see Doron 1986): 

(18) TIA NIA PTY 

‘A righteous one (he) is the LORD.’ (Lam. 1.18) 

The pronominal clitic is used to indicate that the adjectival con- 
stituent should be understood as the subject, rather than the 
predicate. The conjunctive accent on the first constituent and the 
pronominal clitic that follows it indicates that the sentence 
should be interpreted as ‘A righteous one is the LORD’, not as 
‘The LORD is righteous’ (with the adjective as the predicate). 

Generative linguistics has also been used to provide new 
and insightful analysis of several syntactic constructions involv- 
ing pronouns. First, the so-called ethical dative with the preposi- 
tion lamed is not a sentence constituent but a reflexive anaphor 
that is translated in English as x-self (e.g., Gen. 22.2 -x F771 
man pix ‘and take yourself to the land of Moriah’; Naudé 1995, 
1997). 

Second, all of the dative constructions in BH can be differ- 
entiated with attention to their syntactic features as identified by 
generative grammar (Naudé 2013); these are summarised in 
Table 1. 
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Table 1: Summary of Dative Constructions in Biblical Hebrew 


; : Indirect : ee 
Ethical | Possessive ; Commodi | Experiential 
object 
Subcategorised - - + = = 
Compulsor 
ae, + = - 4. 4p 

clitic 
Reflexive 

+ = = = = 
anaphor 
Bound to exter- 

+ - - + - 
nal argument 
Bound to inter- 

- + + - + 
nal argument 


Third, coordinate subjects consisting of an independent 
personal pronoun and a noun phrase should be understood as an 
adjunct to the null subject of the finite verb rather than as its 
subject (Naudé 1999; cf. Holmstedt 2009; Scheumann 2020). In 
light of the characteristics of null subject languages generally, it 
is preferable to identify so-called coordinate subjects with 
independent personal pronouns in BH as adjuncts rather than 
sentence subjects (e.g., Judg. 11.37 naax) oaAAm oy ATI n2>NI 
ony "23x “ina-dy ‘and I will go up and down on the mountains 
and mourn for my virginity, I and my companions’). The structural 
position of an independent pronoun as a coordinate subject 
coincides with that of a dislocated constituent in a left/right 
dislocation, namely, a constituent that is base generated in an 
adjunct position (Naudé 1999, 75-99). 

Finally, generative syntax provides a means to distinguish 
between pronouns and anaphora (i.e., linguistic units deriving 
their interpretation from a previously expressed unit of meaning, 
viz., an antecedent, such as reflexive pronouns and reciprocal 
pronouns). Because pronouns and anaphora in BH have identical 


morphological forms (contrast English pronouns I, me, my as 
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distinct from the reflexive anaphor myself and the reciprocal 
anaphor ourselves), one can only distinguish pronouns and ana- 
phora in BH on the basis of the theoretical syntactic principles of 
generative grammar; contrast the translation of the pronominal 
suffix on i) with the pronoun (‘him’) in Neh. 13.5 as opposed to 
the anaphor (‘himself’) in 1 Chron. 15.1. The syntax of the two 
sentences appears to be identical unless one appropriates the 
insights afforded by generative linguistic theory (see Miller- 
Naudé and Naudé 2019). Identifying whether a Hebrew form is 
an anaphor as opposed to a pronoun is critical for translating and 


interpreting these biblical sentences. 


3.5. Lexical Categories and the Meaning-Making of 
Lexical Morphology 


Lexical categorisation or word classes (traditionally called ‘parts 
of speech’) in generative grammar is part of Universal Grammar.° 
As indicated above in §2.7, each lexical item in the mental 
lexicon is identified as a member of a particular category. For 
lexical categories (as opposed to functional/grammatical cate- 
gories—see §3.6), generative grammar employs the traditional 
philological terms N(oun), V(erb), A (adjective/adverb), and 
P(reposition). 

A generative approach to lexical categories provides a 


heuristic method for lemmatisation within lexica as well as a 


° See Miller-Naudé and Naudé (2017a, 276-88) for a description of 
categorisation in generative grammar and a comparison of 
categorisation in Functional Grammar, Cognitive Linguistics, and 
linguistic typology. 
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framework for differentiating morphosyntactic characteristics of 
lexical items as a basis for determining their semantic and prag- 
matic value(s). As an example of a generative approach to lexical 
categories, an analysis of 110 demonstrates that it must be cate- 
gorised as an adjective and not a noun on the basis of morpho- 
logical as well as internal and external syntactic features (Miller- 
Naudé and Naudé 2017a, 288-303; see also §3.8). 


3.6. Quantification and the Meaning-Making of 


Quantifiers 


Quantifiers are a functional category in generative linguistics. 
Quantifiers specify the amount or quantity of the referents of a 
noun. In this section, two kinds of quantification are considered: 


negation and the quantifier 55. 


3.6.1. Negation 


Generative linguistics has contributed to a nuanced understand- 
ing of the semantics of negation in BH through the concept of 
negative scope, which allows interpreters to distinguish between 
a negative marker that has scope over the entire predication 
(sentential negation) as opposed to scope over only a constituent 
(constituent negation; see Snyman and Naudé 2003; Snyman 
2004; Naudé and Rendsburg 2013; Miller-Naudé and Naudé 
2015; 2017b). 

Sentential negation requires the negative marker to 
immediately precede the verb (e.g., Jer. 23.21 -nx *anQw-Nd 
o’xain ‘I did not send the prophets’). By contrast, constituent 


negation immediately precedes a non-verbal constituent (e.g., 
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Gen. 45.8 D°O0>NF 72 AIA Nn’ OANW OAN-ND AMY ‘And now you 
did not send me here, but rather God’). For the negative cycle in 
BH, see §3.9. 


3.6.2. Quantifier 55 


Generative linguistics allows interpreters to distinguish four 
syntactic constructions involving the quantifier 52, differentiated 
by specific features of the noun phrase (NP) modified by 5> (see 
Table 2).° The most important features are definiteness and 
number, but countability (count nouns, mass nouns, collectives), 
nominal status, and specificity are also relevant. Each construc- 
tion conveys various semantic nuances: singular definite (e.g., -2 
xia off ‘all that day’ Exod. 10.13), plural definite (e.g., 7-52 
qn ‘all the days [totality of the specific group] of your life’ Gen. 
3.14), singular indefinite (e.g., ni-52 ‘every day’ Ps. 140.3), plural 
indefinite (e.g., Twa oADRATM’a Nig yD ‘all [each and every 
one of] the days the house of God was in Shiloh’ Judg. 18.31). 
When the noun refers to a unique entity or a collective, the 
singular definite quantified phrase means ‘the whole’ (e.g., ~52 
yixa ‘the whole earth’ Gen. 1.29; Dnpn-5a ‘the whole assembly’ 
Ezra 2.64). 


© See Naudé (2011a; 2011b) for BH and Naudé and Miller-Naudé (2015; 
2022) and Miller-Naudé and Naudé (2020b) for Qumran Hebrew. 
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Table 2: Syntax and Semantics of 5 
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Indefinite 
Definite distributive, 
collective, inclusive, specific | implicitly inclu- 
sive, non-specific 
Totality of the 
individual 
Totality of the 
. members of the P Each, every 
Meaning : ; entity—the 
single entity— ers 
each individual 
ae entity 
< | Individ- | [+ individ- [ + individ- 
a : : = : 
3 uation uation] uation] 
bo 
& count noun 
Nn : 
referring to 
Count- count noun, ; : 
a unique entity count noun 
ability mass noun ; 
/ collective 
noun 
Nominal nominal or 7 : 
; nominal nominal 
status pronominal 
Each and eve 
. Totality of the ef 
Meaning fi one of the mem- 
specific gro 
. pao bers of the group 
5 | Individ- [-individ- eran 
= ; . [-individuation] 
g uation uation] — 
3 
a Count- count noun, 
ve count noun 
ability mass noun 
Nominal nominal or . 
: nominal 
status pronominal 


Grey highlighting indicates a distinctive class of singular definite nouns 
that are either count nouns referring to a unique entity or mass nouns. 
The semantics of 52 with this class of nouns differs from other singular 


definite nouns. 
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Because BH has additional constructions with >> that are not 
attested in English or in Modern Hebrew, determining the precise 
nuance of each construction requires careful attention to the con- 
stellation of features—morphological, syntactic, and semantic— 
that differentiate them. 


3.7. Relative Clauses and the Meaning-Making of 


Relative Nominal Modification 


The fact that relative clauses in BH may be either restrictive 
(identifying the precise referent of the head noun) or non- 
restrictive (qualifying but not identifying the head noun) in their 
semantics went unnoticed by philologically-orientated Hebrew 
grammars, but it is an important linguistic distinction with exe- 
getical ramifications. 

Holmstedt (2016, 194-215) provides the linguistic analysis 
of the two types of relative clauses and the linguistic indicators 
for differentiating them.’ For example, in 2 Chron. 15.11 (imam 
wan Oowiarya xia Dra 1179), the relative clause is syntactically 
marked as restrictive because of the zero marked relative with a 
definite head (by contrast, zero marked relative clauses with an 
indefinite head are non-restrictive). The translation must be ‘and 
they sacrificed to the LORD on that day from the booty that they 
had brought’ (restrictive), rather than ‘and they sacrificed to the 
LORD on that day from the booty, which they had brought’ (non- 
restrictive; see Holmstedt 2016, 210). 


” For the restrictions on the resumptive element in Biblical Aramaic 
clauses, see Naudé (1991b; 1996). 
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3.8. Empty Categories, Null Constituents, and the 


Meaning-Making of Invisible Structure 


Generative grammar employs a number of null (or, zero) items. 
Null items are present in the underlying structure of language but 
not in the surface structure. Nonetheless, null items affect both 
the surface realisation of language and its semantic inter- 
pretation. A few of these null items are mentioned briefly in this 
section. 

In §3.2, null subjects of finite verbs were discussed. These 
subjects are not expressed in the surface structure, but they can 
nonetheless serve as an antecedent (see also §3.4). 

In §3.3, the edge constructions that involve movement of a 
constituent to the left (beginning) of the sentence (viz. topicali- 
sation) or to the right (end) of the sentence (viz. extraposition) 
contain a null constituent. Moved constructions leave a trace in 
their original position, which may affect the structure of the 
sentence. 

Relative clauses may optionally have two types of null 
constituents. Null heads are used frequently in BH (Holmstedt 
2016, 113-28) instead of a NP as head (contrast Gen. 27.45 
with a null head and Gen. 24.66 with an overt NP as head; see 
Holmstedt 2016, 114). Interpretation of the semantics of the null 
head depends upon both “the position of the gap (or trace) or 
resumptive within the relative clause and the discourse context” 
(Holmstedt 2016, 115). Zero relatives also occur in BH, alongside 
a range of overt relative markers (Holmstedt 2016, 81-83). As 
indicated in 83.7, zero marked relative clauses have a restrictive 


semantic interpretation when the head of the relative is definite, 
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whereas overtly marked relative clauses may have a restrictive 
or non-restrictive semantic interpretation. 

Recognising that adjectives in BH may modify a null noun 
provides a principled way to handle the identification of the 
‘adjective’ lexical category. For example, in Gen. 29.16 (mw 72951 
on qopa ow) AX? 19730 ow nia), the determined adjectives 79737 
‘the big’ and niopn ‘the small’ must be understood as morpho- 
logically agreeing with and modifying the null noun na, so that 
the adjectives refer to ‘the elder [daughter]’ and ‘the younger 
[daughter]’ (see Miller-Naudé and Naudé 2016). Recognition 
that an adjective may modify a null noun provides a principled 
linguistic argument against so-called ‘substantival adjectives’ in 
BH. 

Another advance has been made in recognising that so- 
called ‘verbless sentences’ should be understood as sentences in 
which the copular verb 7°n is a null constituent (see, e.g. Naudé 
1993a; DeCaen 1999; Sinclair 1999; Cowper and DeCaen 2017; 
Wilson 2020). Recognition that all predications in Biblical 
Hebrew have the same underlying structure provides a simplified 
and unified analysis. 

Finally, ellipsis comprises a number of distinct syntactic 
constructions in which one or more required constituents are 
present in the underlying representation but are absent in the 
surface structure (see Miller-Naudé 2005; 2007a; 2007b; 2007c; 
2008; 2011; 2013; Holmstedt 2021). Ellipsis therefore requires 
the hearer/reader to be able to reconstruct appropriate seman- 
tic interpretations of sentences with apparently incomplete 


structures. 
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3.9. Historical Linguistics and Meaning-Making 


through Language Change and Variation 


Generative linguistics provides a way to identify language change 
and variation through the change of feature parameters. In exam- 
ining the change of feature parameter settings between BH and 
Qumran Hebrew, Naudé (2000; see also 1996a) identifies the 
following types of changes: change in syntactic category descrip- 
tion (viz. grammaticalisation); change in subcategorisation frame 
of some lexical verbs (e.g., the use of the preposition a to mark 
the accusative; contrast Jer. 7.26 with 4Q506 131-132.IV.13) 
and lexical nouns (contrast Isa. 48.17 and 1QIsa* XL:23); and 
changes in lexical features of both nouns and verbs. 

Another important development in historical linguistics 
relates to cyclic change, the observation that change often occurs 
in stages, with later stages of the language resembling earlier 
stages (Van Gelderen 2011). One important cycle that has been 
identified in many languages involves the negative existential 
(see Veselinova 2016), which is also operative in pre-modern 
Hebrew (see Naudé and Miller-Naudé 2016; Naudé, Miller- 
Naudé, and Wilson 2019; 2022). One of the most important 
insights for Biblical Hebrew is the fact that the stages of a cycle 
are usually overlapping, resulting in synchronic variation along- 


side diachronic change. 


4.0. Prospects for Further Study 


The survey of the study of BH in Van der Merwe, Naudé, and 
Kroeze (2017, 6-12) reveals that comparatively little work has 


been conducted from the theoretical standpoint of generative 
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linguistics, in spite of its rich and insightful contributions to the 
study of language in general and to the study of languages in 
particular. As a point of comparison, Radford’s (1997a; 1997b; 
2004; 2009) analyses of the structure of English from a Mini- 
malist perspective demonstrate how generative linguistics can be 
applied to the study of a specific language to produce comprehen- 
sive grammatical analyses. There is, unfortunately, nothing even 
remotely comparable for BH from a generative perspective. The 
research described in the previous sections only begins to scratch 
the surface of some of the numerous important, unsolved prob- 
lems in the structure of BH, which have direct implications for 
the meaning-making enterprise of textual interpretation. 

To conclude, we return to our initial questions. Is linguis- 
tics necessary for the interpretation of the languages of the bib- 
lical texts? Is it not better—and easier—just to read the texts 
without theory? In our view, the reading of the biblical texts in 
the light of linguistic analysis can be compared to the work of 
meteorologists. Meteorologists study weather patterns scientifi- 
cally, employing relevant theories for the observation, descrip- 
tion, and prediction of weather patterns. The results of meteoro- 
logical inquiry feed into weather reports, which are delivered in 
simple, accessible terms for ordinary people to understand. 
Although sustained observation of the weather by laypersons 
(e.g., in farming almanacs) may sometimes prove to produce 
accurate forecasts, it is not theoretically informed and does not 
rest upon scientifically proven methodology. It is therefore vastly 


inferior to the analysis of weather produced by meteorologists. 
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If linguistics is necessary for insightful understanding of the 
biblical text, then which linguistic theory should be used? We 
need to break with the tradition of viewing linguistics through 
the reductive lenses of schools, theories, or interdisciplinary 
exchanges. Instead of the fragmentation of knowledge, the focus 
must be on the progress of knowledge by its growth or accumu- 
lation (see D’hulst and Gambier 2018). Furthermore, the search 
for knowledge must be a purposeful, meaning-making activity 
that is functionalist in orientation (Nord 2018). In other words, 
the search for knowledge within a discipline must be nuanced 
and it must be typified by the purpose for which it is intended. 

In applying the discipline of linguistics to the study of the 
biblical text, instead of competition between linguistic theories, 
the move should be to a search for a complex viewpoint, seeing 
the study of BH as a complex whole (Miller-Naudé and Naudé 
2020a; Naudé and Miller-Naudé 2020). However, within a com- 
plex approach, the significant and insightful generative linguistic 
contributions to understanding BH language structure cannot be 


minimised or ignored. 
5.0. Further Reading 


5.1. Handbooks, Companions, Glossaries 


Den Dikken (2013) 

England (1978) 

Everaert and Van Riemsdijk (2006) 
Sells (1985) 

Webelhuth (1995) 
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5.2. General Introductions 


Bach (1964) 

Carnie (2007) 

Haegeman (1994) 

Hornstein, Nunes, and Grohmann (2005) 
Radford (2004; 2009) 

Van Gelderen (2017) 

Van Riemsdijk and Williams (1986) 


Ny ee Be 


5.3. Foundational Texts 


1. Borer (1984) 

2. Chomsky (1957; 1965; 1969; 1975b; 1980; 1981b; 
1986b; 1992; 1995; 1998; 2013) 

3. Jackendoff (1972) 

4. Katz and Postal (1964) 
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FUNCTIONAL GRAMMAR AND THE 
PRAGMATICS OF INFORMATION 
STRUCTURE FOR BIBLICAL LANGUAGES 


Randall Buth 


The pragmatics of information structure, and Functional Gram- 
mar in particular, encourage and provide a framework for 
students and scholars of the biblical languages to read with 


increased nuance and precision. 


1.0. Historical Development 


Functional grammatical approaches include semantic and prag- 
matic information in a grammatical description. Pragmatics 
refers to the effects that are made within communication and will 
be discussed below. This inclusion of pragmatic information 
within a formal grammatical description can be contrasted to 
transformational-generative approaches that adopt a policy of 
autonomous syntax. The Functional Grammar (FG) developed by 
Simon Dik in particular reflects the confluence of two develop- 
ments in linguistics: (1) the pragmatics of information struc- 


ture as extrapolated, for example, from Functional Sentence 
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Perspective (FSP)! and (2) generative-transformational grammars 
that stem from Noam Chomsky, especially 1957-1980. The pri- 
mary pragmatic functions that were incorporated in the early 
sentence grammar of FG were Topic and Focus, discussed below. 

Generative and transformational grammar were a major 
impetus for FG. A generative grammar wants to know the rules 
that will produce all acceptable, grammatical sentences of a 
language and restrict the production of unacceptable, ungram- 
matical sentences. The primary difference between a strictly 
generative approach and functional approaches can be summa- 
rised in the notion ‘autonomous syntax’, which is the restricted 
domain of formal grammar without including semantic or prag- 
matic information in the rules. Functional approaches to gram- 
mar, on the other hand, reject such a restriction and recognise 
that pragmatic functions like Topic and Focus exert direct influ- 
ence on the rules. In addition to including pragmatics in formal 
grammatical description, FG eschews the idea of transformations 
and filtering devices in formal grammar as psychologically prob- 
lematic, as discussed below. 

Dik published the basis of this approach in two volumes 


(1978; 1980). From the beginning, he embraced a principle of 


' For examples, see Dane’ (1974) and Firbas (1992). The roots of FSP 
precede World War II. FSP broadly analyses communication as 
comprising known or assumed information as a base that is called the 
Theme. New or salient information is then added to the Theme as the 
intended communication. This salient meaning is termed the Rheme. In 
English, these two perspectives are often called Topic and Comment, 
respectively. 
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rigorous formalism in his theory. The various systems of syntax 
(like predicates, subjects, and objects), semantic roles (like agent, 
experiencer, and patient), and pragmatics (like Topic and Focus) 
are assumed to influence the final output of a sentence and 
should be included within formal grammar. However, as these 
ideas were applied in studies on various languages, the complexi- 
ties inherent in these notions have led to increasingly complex 
notation. Dik (1989) expanded many kinds of Topics and Foci. 
Some ideas from discourse analysis and greater attention to prag- 
matics led to positing multiple layers of underlying represen- 
tation before the surface level sentences would be generated or 
interpreted within a grammar. 

A former student and colleague, Kees Hengeveld, edited 
and published Dik’s work posthumously in 1997. Hengeveld and 
J. Lachlan McKenzie then went on to include a formal discourse 
framework within that theory, ideas that developed out of a con- 
cern to include syntax, semantics, and pragmatics within formal 
grammatical theory. Hengeveld and McKenzie forged these into 
a comprehensive theory with four abstract layers. They published 
their collaborative effort in 2008, Functional Discourse Grammar 
(FDG; for a brief description of FDG, see Hengeveld and 
McKenzie 2005, 668-76). Since then, FDG has been considered 
the successor of FG and current studies tend to use the 2008 
volume as a starting point. However, the complex metalanguage 
of later developments presents a barrier for those primarily inter- 
ested in biblical languages, while the initial pragmatic functions 
in FG are more accessible. Concepts from information structure 


are widely used within other functional grammars (see, e.g., 
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Givon 1984, who downplays rigorous formalism, and then Givén 
2001, where he reincorporates some formalism; see also Foley 
and Van Valin 1984). The pragmatic functions of information 


structure within FG will therefore be the primary focus here. 


2.0. Key Theoretical Commitments and Major 


Concepts 


People communicate with language, prototypically with sound, 
but also in written media and in sign languages. A signal is 
intended to communicate meaning. In ‘The girl walked to the 
market’, the speaker wants the hearer to think about a particular 
girl, a particular market, and communicates that the girl walked 
there. Grammatical theories attempt to explain how that sentence 
is produced. 

With this in mind, FG aims at several goals.” These include, 
first, typological adequacy, meaning the theory should be formu- 
lated in terms of rules and principles that apply to any natural 
language. Second, pragmatic adequacy, meaning the theory 
should explain how linguistic expressions may be used in com- 
munication. Third, psychological adequacy, meaning that what 
the theory says about a language should be compatible with what 
is known about the psychological mechanisms involved in natu- 
ral language processing. 

To these ends, FG has identified certain functions that 


affect the output of a predication into sentences. These include:? 


? See Dik (1980, 2). 
3 See Dik (1980, 3). 
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SEMANTIC FUNCTIONS (Agent, Patient, Recipient, etc.), 
which define the roles that participants play in states of 
affairs, as designated by predications. 

SYNTACTIC FUNCTIONS (Subject and Object), which define 
different perspectives through which states of affairs are 
presented in linguistic expressions. 

PRAGMATIC FUNCTIONS (Theme and Tail, Topic and Focus), 
which define the informational status of constituents of 
linguistic expressions. These relate to the embedding of 
the expression in ongoing discourse and are determined 
by the status of the pragmatic information of Speaker and 


Addressee as it develops in verbal interaction. 


Using these functions, FG posits that an underlying abstract 


predication frame could produce the following sentences: 

(1) The girl walked to the market in the morning. 

(2) In the morning the girl walked to the market. 

(3) As for the girl, she walked to the market in the morning. 


The predication frame comes from the abstract lexicon of a lan- 
guage and would join three arguments (sentence constituents) to 
that predication, ‘to walk’: the girl, to the market, and in the 
morning. These arguments receive different taggings for syntax, 
semantic roles, and optionally pragmatics. In (1), the girl subject, 
Agent] iS a Syntactic Subject and semantic Agent, while to the market 
[Goal] 1S a Semantic Goal and in the morning jrime; is a semantic Time. 
The semantic taggings are the same for (2), however in the 
MOTNINE rime, Topic) is both semantic Time and pragmatic Topic. In 


(3), as for the girl treme is a pragmatic Theme, a framework for 
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the sentence similar to Topic but outside the sentence syntax. She 
[subject, Agent] 1S a Syntactic Subject and semantic Agent, while to the 
market {goa is a Semantic Goal. Notice the difference between (1) 
and (2). The predication frame and the semantics are the same, 
but the pragmatics differ. In (1) the Time in the morning is part of 
the new, salient, rhemic/comment information, but in (2) in the 
morning has been placed at the beginning according to pragmatic 
placement rules. There the argument is tagged a Topic to provide 
a pragmatic setting or situational orientation for the communica- 
tion that follows.‘ 

FG assumes that predications use the functions explained above 
to generate sentences, a process which occurs with abstract 
templates specific to a particular language. The template speci- 
fies where certain functions may appear, and positions in the 
templates can be filled by individual words, phrases, and clauses. 
So discussions about word order are actually about constituents 
or ‘pieces’ of a sentence that are positioned to fulfil one of the 
syntactic, semantic, or pragmatic functions. A sentence is able to 


organise and output the functions as follows:° 
(4) Theme; Pragmatic positions, Syntactic positions; Tail 


Within this template, syntactic positions might involve numer- 
ous variations, such as S-O-X; S—V-—O-X; V-S-O-X, etc. But 


* A Topic is pragmatically marked material that orientates a sentence to 
its context. It does not have Focus intonation and is not necessarily the 
broader subject matter of the sentence. 


° The templates in (4) are also simplified and without the algebraic 
notation of much of FG and FDG. 
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pragmatic positions appear before syntactic positions, which is 
fundamentally important for FG and for human communication 
in general.© Such a pragmatic position allows FG to explain a 
phenomenon in V-S-—O languages, namely the possibility of S-V- 
O as an/the alternative order (see Dik 1980, 155). 

While syntactic, semantic, and pragmatic functions are all 
necessary for FG as a theory, understanding pragmatic functions 
may help readers of biblical languages most. Pragmatic functions 
are part of all natural language and must be internalised to com- 
municate in any given language. Two of the pragmatic functions 
noted in (4) operate outside sentence syntax and are separated 
by semicolons in the templates. First, the Tail is an afterthought, 
prototypically at the end of a sentence, whether as a side com- 
ment or reinforcement. Also, a phrase like an appositive or paren- 
thetical comment can be treated as a Tail that may be placed 
inside a syntactical sentence. Second, the Theme is any intro- 
ductory, setting, or topical material that is prototypically at the 
beginning of a sentence, traditionally treated as a casus pendans 
and called left dislocation in linguistics. 

The FSP distinction between Topic and Comment has led 
to a foundational distinction of marked information structure in 


FG: Topic and Focus. A Topic constituent is specially marked 


° As part of typological adequacy, FG theorises that a potential 
pragmatic position exists at the beginning of a main clause of whatever 
the basic order may be. FG does not claim that devices like intonation, 
morphology, or special vocabulary do not also play roles in informa- 
tional pragmatics. But placement rules and positions before the main 
syntactic components are stipulated as universal. 
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information that is presupposed or already established. It is a 
starting point for the rest of the sentence and potentially linked 
to the larger context. A sentence constituent that functions as 
Topic is eligible for placement in a pragmatic position before the 
main, default sentence order. But a Topic is not necessarily the 
syntactic Subject, which often creates confusion. In English, the 
word ‘topic’ is a synonym of ‘subject’. But in FG a marked Topic 
is specifically a constituent that provides a framework and link 
to the larger context in which a sentence occurs. It may be a 
Subject, but it may also be some other constituent. Given this 
potential confusion, I use the term Topic in this article when 
trying to relate primarily to FG and broader linguistic literature. 
But I use the term Contextualising Constituent (CC) when includ- 
ing the multivalent nature of pragmatic Topics that goes beyond 
marked constituents as subjects and settings. 

A Focus constituent is a specially marked part of the Com- 
ment, which contains the new or most important information of 
a sentence. Focus may be marked by word order, particles, and 
often intonation. It can include many kinds of Comment infor- 
mation, including new, supplemental, reinforced old, contrastive, 
or contra-expected information.’ 

FG thus helps identify pragmatic placements in word order 
and the resulting nuances of understanding. But because FG aims 
at both psychological and pragmatic adequacy, it does not allow 
unmotivated, ad hoc rules to correct mis-predictions of hypothe- 


sised rules. This can be stated in four principles: 


7 Lambrecht (1994) adds ‘sentence focus’ to the Topic and Focus of 
information structure, as discussed below. 
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1. Avoid transformations in the sense of structure-changing 
operations; 

2. Avoid empty elements in underlying structure that are 
unexpressed; 
Disallow filter devices; 
Disallow abstract lexical decomposition (instead, account 


for meaning with definitions). 


So if a constituent is pragmatically placed at the beginning of the 
clause, and the language regularly follows with V-S, then that 
language shows a basic order of V-S. FG does not unnecessarily 
multiply rules. FG would not suggest an abstract S—V order, place 
a pragmatic function in front, and then add a filter to flip the S— 
V order to match the data. An additional rule that would switch 
the placement of S and V after a pragmatic placement would be 
ad hoc. Instead, FG asks simple questions: Would such rules be 
psychologically real? Is there any pragmatic motivation for the 
rule? Are such rules necessary? If the not, then Occam’s Razor 
and the minimal abstractness principle of FG declare such a rule 
to be contra-indicated and ad hoc. 

These principles have been particularly helpful in describ- 
ing word order patterns in Germanic languages, but may also be 
applied to the biblical languages.® For illustration, FG can demon- 
strate a continuum between various V-S languages to the point 
where one may develop into an S—V language. Four categories of 


these languages are described: 


8 Dik (1980, 152-77) devotes a whole chapter to this question. 


76 Buth 


V1 (verb first), where the statistical majority of sentences 


show V-S at the beginning of the clause. 


There do not appear to be any V-S languages that do not also 
allow S-V word order. This fact supports the typological ade- 
quacy of FG, which assumes a pragmatic position before a main 


clause of whatever the basic order may be. 


V2 (verb second), where the majority of sentences show some 


non-verb element first in a pragmatic position. 


In these languages, a pragmatic position rule is activated fairly 


commonly. 


V2s (verb strongly second), where all sentences, allowing for 
some pragmatic exceptions, show the verb in a second 


position. 


A V2s language might look like an S—V language at first glance. 
A pragmatic positioning rule becomes mandatory, thereby dilut- 
ing the strength of the pragmatic marking. The only choice is 
whether the Subject or some other constituent fills the pragmatic 
position. “A V2s language will only have the following constitu- 
ent orderings: a. SVO, b. XVS(O)” (Dik 1980, 158). In terms of 
pragmatics, the S-V orders in a V2s language will function simi- 


larly to the way that Subjects function in real S—V languages. 


V3 (verb third), where the verb comes third whenever a con- 
stituent is pragmatically placed at the beginning, before 


the syntactic S-V. 


In V3 languages, the Subject follows the pragmatic position, so 


that the Verb comes third. Without a pragmatically positioned 
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constituent, the Verb may appear second, after the Subject. In 
other words, a V3 language is a true S-V language, as is English.° 

Not only can FG meaningfully describe the various cate- 
gories of word order, but FG also predicts historical development 
with increasing optional use of pragmatic placement rules 
through time. Eventually, a V2s language can become a V3 lan- 
guage. Dik (1980, 169-75) lists Celtic languages and Welsh as 
examples of V1. Then, within Germanic languages, Old Icelandic, 
Old High German, and Middle Dutch are V2; Icelandic, Norse, 
Danish, and others are V2s; and finally English is V3 (SVO). 

The importance of these basic concepts of FG is best appre- 
ciated when looking at the data of real languages. The next sec- 
tion shows how an FG approach can aid the reading and under- 


standing of biblical texts.1° 


° This does not mean that additional patterns do not exist. For example, 
English has a vestigial question order ‘When did the girl return?’ The 
order ‘did the girl’ is V-S and reflects an archaic word order pattern in 
the history of Germanic languages. 


'° FG also deals with highly presupposed information that is attracted 
to Verbs in what is called Language Independent Preferred Order of 
Constituents (LIPOC), as well as secondary clines of post-verbal 
saliency. These ideas are not discussed in this essay, as they are less 
important for the general reader of biblical languages. 
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3.0. Use and Contributions of FG in Biblical 
Studies 


3.1. Application to Hebrew 


Some scholars of Biblical Hebrew have claimed that the language 
is based on S-V order, while a majority counter that it has a V-S 
order. Even claims that there is no basic order must still explain 
the examples of both S-V and V-S order.'! Recognising constitu- 
ents in communication that have special pragmatic marking can 
be a powerful tool for readers and for an audience. Pragmatic 
signals add information and perspective beyond the basic refer- 


ential semantics of a communication. 


3.1.1. A Test for V-S—O in Hebrew 


Dik (1980, 154) provides a simple test that has explanatory 


power for Biblical Hebrew: 


If, whenever some constituent other than the Subject is 
brought to Pl, the Subject itself appears in preverbal 
position, then the language is a ‘real’ SVO language with 
basic pattern P1SVO. 


™ See Hornkohl (2018), who has provided a review of word order 
studies and found the arguments for a S-V word order unpersuasive. He 
also recognises that this means that all S-V orders would need prag- 
matic explanations, both by those positing a V-S default order and by 
those who might claim that all orders are pragmatically motivated in 
BH. 
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If on the other hand, the Subject appears in postverbal 
position in that condition, then the basic pattern must have 
been P1VSO. 


This is so because FG provides no other means for explain- 
ing the occurrence of postverbal Subjects than the assump- 
tion that the basic position of the Subject is postverbal in 
such cases. In particular, purported rules of Subject Post- 
posing of Subject Verb Inversion simply cannot be formu- 
lated within the framework of FG. Part of the aim of this 
chapter is to demonstrate that this is an advantage rather 
than a weakness. 


According to the criterion formulated above, English is a 

real P1SVO language. On the other hand, Dutch, German, 

and in fact all the other living Germanic languages come 

out as P1VSO languages according to this criterion. 
Generally speaking, in BH, when some item other than a con- 
junction and a Subject is pre-verbal, the remaining order is V-S, 
which is what FG predicts for a V-S—O language, as in (5a): 
(5) (a) niygona ooxa ne 

‘With God Noah walked’ [X;,-.;-V—-S] (Gen. 6.9) 

(b) | JeanT nd OF ORTTAN* [Xpocus-S-V]? 

An S-V model with the same contextual pragmatics would have 


incorrectly predicted (5b).'? The texts in (6) and (7) provide 


In linguistics, an asterisk marks either an incorrect/ungrammatical or 
an unattested form. Here it signals that this is not the attested form in 
the passage under discussion. An X-S-V order would have been a poor 
fit here, but it can occur in a different context with different pragmatics. 
13 Holmstedt (2009; 2011) invokes a special rule here to switch S-V to 


V-S to avoid the mis-prediction, but this is unmotivated and ad hoc from 
a FG perspective. 
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similar examples, in which (6b) and (7b) show what an S-V 
language would have produced: 
(6) (a) 2BATTNS YD ND PWATNY OP] OPT 
‘And on the day of raising the tabernacle the cloud 
covered the tabernacle’ [X-V-S-O] (Num. 9.15) 
(b)  j2WAATNY NOD Tw] WAT NY OD OPI" [X-S-V-O] 
(7) (a) InN Wy D?AT TAVITA 8B TW MY yIN 
‘And in the 14th year Kadorla‘omer came, and the 
kings that were with him’ [V-S] (Gen. 14.5) 
(b) ny Wx DDT] a IDYITIA TY MY papa [S-V] 
These examples illustrate that BH is a V-S language, whose basic 


template can be listed in simple form as 
P-V-S-—O-X 


where P is a pragmatically placed constituent. If this is correct, 
then every case of some non-verb item preceding the verb is a 
pragmatic signal to the audience. That is a strong claim and a 


helpful guideline for readers. 


3.1.2. Topic and Focus in a VSO Framework 


Several examples in Hebrew can illustrate how Topic and Focus 
function in a V-S-O framework before looking at greater 
sentence-level effects. 
Looking first at examples of Focus, consider (8): 
(8) nXtomwy cap pprai-azy-ona 
‘with an innocent heart and clean handsj,,.,,; I did this’ 
(Gen. 20.5) 
In this example, the pre-verbal material is the main point, part of 
the Comment and a marked Focus construction. Something simi- 


lar occurs in (9): 
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(9) cmat ropa my 3773 

‘because from much thought and angerjpous; | was speaking’ 

(1 Sam. 1.16) 
In the context, Hanna is explaining her own behaviour, so the 
words ‘much thought and anger’ are the salient, main part of the 
Comment of the sentence. They are placed in front of the verb as 
a marked Focus. As a final example, answers to questions fre- 
quently use fronting for the most meaningful part of an answer, 
as in (10): 
(10) 7TaV awa TRA ADIN PINA POX TANT NAN PR 

‘“’,.and from where would you be coming?” And they said 

to him, “From a very far country;;..;; have come your 

servants” (Josh. 9.8-9) 

Turning now to consider pragmatic Topics, the text in (11) 
provides a good example: 
(11) ody tas adn ona nad>w-bx Ain? neq yiyasa 

‘In Gibeon;topic) appeared the LORD to Solomon in a night 

dream, and God said...’ (1 Kgs 3.5) 
This is classic syntax of V-S where the MT has a pragmatic Topic 
(CC) that provides a locational setting to the sentence and the 
greater context.'* Something similar occurs in (12): 
(12) (a) nana? ow A nN IP? ONT? RYT 

‘and Israel went out,ys, to the Philistines for war’ 
(b) Twa agtroy nn 
‘and they camped at Even-Ezeryyx,’ 


' The place Gibeon provides a Setting for the scene rather than a subject 
of a sentence, which is why the term Topic can be misleading and why 
I prefer Contextualising Constituent. 
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(c) pax. an onw?A 
‘and the Philistines; 7); camped at Afeq’ (1 Sam. 
4.1) 
In this example, the Philistines are marked as a Topic that pro- 


vides comparison and contrast with Israel. 


3.1.3. Subordinating Clause with Focus 


Most subordinating conjunctions in BH introduce a V-S clause. 
However, pragmatic frontings may occur. The following two 
examples have subordinating conjunctions, one with a fronted 
Subject and one with a fronted Object. Both fronted constituents 
are pragmatically marked as Focus. 
(13) ona33 907 *2 apy? pT 

‘And Jacob did not know,y.s; that Racheljs gocus; Stole them 

[the household gods]’ (Gen. 31.32) 
The Subject comes before the Verb inside this 2-clause. ‘Rachel’ 
can be read as a Focus and as exclusive and contra-expected 
information from Jacob’s perspective. He already knew that a 
theft had occurred. Consider also (14): 
(14) nip np TE TWs "2 

‘as a nation that righteousnessypocus, op; it did’ (Isa. 58.2) 
Again, placing the Object before the Verb can be explained here 
as Focus, because n?7¥ ‘righteousness, justice’ is a contra- 
expected point for irony: ‘as if they were a nation that did what 
would be, in fact, right’. 

In general, subordinating conjunctions provide a link toa 
context themselves, so they do not regularly need a pragmatic 
Topic to link to the context. Thus, if the subordinate clause marks 


a pragmatic position, it is most often Focus, with the most salient 


Functional Grammar 83 


part of a sentence placed before the Verb. This, too, helps the 
audience and readers, who would add a Focal intonation to dif- 


ferentiate Focus from Topic. 


3.1.4. Template for Participles in Hebrew 


The advantage of a rule-based approach to word orders can be 
illustrated with participles as well. Most grammarians recognise 
that the Hebrew participle fits a pattern that is more congruent 
to a Verbless clause than to a clause with a finite verb (for more 
details, see Buth 1999). In FG, the template for Biblical Hebrew 
participles is thus 


(+ /- Topic) (+/- Focus) S—Predicate 


where the ‘+ /-’ notation makes explicit that the use of either a 
Topic or a Focus is optional. In the case of participles, the pred- 
icate would be Participle + Complements. Consider the follow- 
ing example: 
(15) (a) amaT AS ARRAN 
‘Whatyo, rocus] 40 you see, Jeremiah?’ 
(b) ANT oIN TRW OPA 
‘An almond stickjo, gocusy 1 see.” [S—Vparticipte 
(c) ming? naw 
‘You’ve done well to see’ 
(d)  inwy) MaToy ax TpwA 
‘for diligent); am I on my word to do it’ 
(Jer. 1.11-12) 
In (15), a question word ‘what?’ in (15a) is positioned as Focus 
and helps us recognise the salient, new information in the fol- 


lowing answer. Jeremiah’s answer in (15b), ‘an almond stick’, has 
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been placed ahead of the Subject-Participle, in the pragmatic 
position as Focus, as the pragmatically marked new information. 
The main sentence in (15b), ‘I see’, is known, pre-supposed infor- 
mation and follows the Focal Object. The Subject immediately 
ahead of the participle is not pragmatically marked. 

Participial clauses and verbless clauses have a default, 
Subject—Predicate word order, different from finite verbal clauses 
with V-S order. When a word order of Participle-Subject occurs, 
then, as in (15d) and (16) below, there is pragmatic marking of 
the Participle, most often Focus: 

(16) 7338 [7 ITAD WR A7NS ON 

‘and in addition, the nation that they serve (as slaves) ;ropic) 

Iam going to judgejrocus;? (Gen. 15.14) 

In (16), the long Object ‘the nation that they serve’ is positioned 
as a Topic to provide the starting point and contextual link for 
the sentence ‘I am going to judge the nation’. However, contrary 
to the basic Subject—Participle word order, the participle comes 
before the Subject so that the participle is a second, pragmatically 
positioned constituent. The immediate and most natural reading 
is that the ‘judging’ is Focus, contra-expected, and specially 
marked. 

We should also note the pattern of Topic—Focus order in all 
of the above examples that have more than one pragmatically 
positioned constituent. This Topic—Focus order is normal in BH 
and fits with a universal tendency seen in languages around the 
world. Consider the text in (17): 

(17) (a) ATH 


‘A generation goes,’ 
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(b) xatin 
‘a generation comes,’ 
(Cc) nypy oPiv? PIN 
‘and as for the earth;7,,,.; it remains foreverppocus)’ 
(Eccl. 1.4) 
In (17c), ‘the earth’ is not first because it is a grammatical 
Subject, but because it is pragmatically marked. In this verse, the 
rules of pragmatic placement help the reader to interpret the 
functions of the sentence pieces. The adverbial complement o7iy) 
‘forever’ in (17c) has been pragmatically placed before the main 
sentence, most transparently as a Focus, since it is specific, new 
information that also contrasts with the notion of temporality in 
the previous clauses. Note that the word order is not pqxa Dip\* 
nqniy ‘and forever the earth remains’, which has only a pragmati- 
cally marked Focus. In (17c), the Subject is also placed before the 
pragmatically marked Focus, such that it, too, is pragmatically 
marked as a Topic. The earth is marked as a Topic because it is 
being compared to and contrasted with passing generations. Then 
the main point is brought out with a Focus structure, o3iy) 
‘(remain) forever’, which is pragmatically marked salient 
information. 
It is this mix of interpreting pragmatically positioned infor- 
mation with the need to account for all of the results through rule 
identification that makes the recognition of pragmatic functions 


a powerful tool for practised readers of the biblical text. 


3.1.5. Sentence Marking and Discourse Pragmatics 


The above examples have illustrated pragmatic functions of 


Focus and Topic (CC) within a V-S perspective. However, 
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scholars agree that there are many examples of seemingly insipid 
Topics.‘ Many sentences have a fronted constituent that is not 
the most salient point of a sentence and thus is not a candidate 
for Focus intonation as a Focus function. Likewise, the constitu- 
ent does not provide a contextual comparison or set up a new 
Topic chain. Instead, the Topic introduces an isolated sentence 
on its own with interesting Semantics and discourse Pragmatics. 
Hornkohl (2018, 44) has summarised the problem: 

Frequently in BH, elements are fronted for purposes of 

marking something special about the entire clause. Though 

some formulations of information structure include the 

possibility of whole-clause marking by means of a fronted 

constituent—for example, Lambrecht’s sentence focus, 

which may usefully explain certain cases of preverbal posi- 

tioning in BH—this is inappropriate for the vast majority 

of XV instances in BH in which X is neither topic nor focus. 
We need to define and identify the ‘signal’ and then discuss 
proposed meanings or function. A fronted constituent is only 
part of the signal. We can assume that intonation would have 


distinguished a Focus from a Topic.'® So we may call a fronted 


8 Moshavi (2010, 119) lists over 43% of her examples in Genesis as not 
clearly either Topic or Focus. Atkinson (2021, 227-30) lists between 
281 and 451 out of 1,029 examples in Samuel-Kings as being not clearly 
Topic or Focus, which would be 27%-44%. He lists 281 examples as 
‘thetic’ and 170 as unclear. 

6 Khan and van der Merwe (2020, 361) point out that it is reasonable 
to assume that an intonational distinction existed even if we do not have 
access to it. 
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constituent for Focus a different signal from a fronted constituent 
for Topic. 

But these signals are not alone. They come with an entail- 
ment in BH: placing a constituent in front of the verb means that 
a wayyiqtol verb cannot be used. It means that the functions sig- 
nalled by a wayyiqtol verb are blocked. This consideration is vital 
in a small, closed system. Bickerton (1981, 90) points out the 
constraints of meaning in a closed system, and although his com- 
ments are about tense, mood, and aspect, they equally apply to 
any closed system, like wayyiqtol V-X order versus X—-V orders: 

We must note a particular characteristic of TMA [tense- 

mood-aspect] systems which, though seemingly obvious, 

has been ignored by virtually all work up to and including 

Comrie’s (1976) influential study of aspect. ... What each 

marker of modality, tense, or aspect means will be largely 

determined by how many markers of these things there are 

in the system and by what each of the others mean. 

The restrictions of a closed system affect meaning, not just for 
the structure itself but against the other structures with which it 
is in contrast. The functions of fronted constituents in Hebrew are 
morpho-syntactically restricted to being non-wayyiqtol or non- 
weqatal. These X-V sentences become a signal that expands the 
Topic signal—structurally, a fronted constituent without a Focus 
intonation—into a sentence-level structural signal, rather than 
simply being a signal for the fronted constituent. The fronted 
constituent blocks and breaks any continuation of a wayyigqtol 
or weqatal. Although a single constituent, it affects the whole 
sentence and becomes a signal for discourse functions of 


various kinds of discontinuities: temporal non-sequentiality, 
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backgrounding, and discourse discontinuity (see Hornkohl 2018, 
45-51, for examples and discussion). 

Khan and van der Merwe (2020, 349-50) agree that dis- 
course pragmatics are the primary motivation for these kinds of 
non-focus, non-topic structures, stating: “We shall argue that sub- 
jective choices concerning discourse structure and organization 
are the ultimate motivations for using thetic sentences.”’” They 
analyse such X-V sentences as ‘thetic’, a concept discussed fur- 
ther in the next section. But they also define thetic sentences 
notionally and not uniquely as a structure, saying that “thetic 
sentences may be expressed by constructions other than constitu- 


ent fronting and have different contours” (2020, 361). 


3.1.6. Fronting and Thetic Sentences 


The discussion of thetic sentences is a contribution to Hebrew 
studies that is congruent to the concept of Background in dis- 
course studies, although it comes from a different analysis of 
information structure and is broader.'® A thetic sentence ‘posits’ 
the whole predication without assuming a clear Theme—-Rheme 
structure (also called Topic-Comment, meaning pre-supposed/ 
established versus salient/non-established information). Proto- 
typically, a thetic sentence answers a question like ‘What 


happened?’, where the whole predication that follows is new 


17 Both Khan and van der Merwe (2020) and Atkinson (2021) link 
fronted non-focal fronted constituents to “thetic sentences.” 


'8 The idea of thetic sentences was first proposed in Linguistics by Sige- 
Yuki Kuroda (1972) and Hans-Jiirgen Sasse (1987). 
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information and thus there is little expectation of Theme—Rheme 
information structure. 

One may ask whether thetic sentences are driving the 
Hebrew word order or if, within Hebrew, the word order is sig- 
nalling something else that happens to be broadly congruent with 
what one might call thetic. A point to remember is that the 
fronted Contextualisation (Topicalisation) breaks any chain of 
wayyigtol clauses or weqatal clauses. In doing that, it is an iconic 
structure where the fronted constituent provides a discourse-level 
function of breaking some continuity that would otherwise have 
been implied by a wayyiqtol structure. This function of signalling 
discontinuity is acknowledged by Atkinson (2021, 119), who says 
that “the notion discontinuity correlates well with theticity, in that 
a new entity is introduced into the CG [i.e., common ground of 
shared understanding], or an inaccessible event which closes, 
transitions, or opens discourse units.” 

We can start to answer our questions from 83.1.5 about 
what is driving X-V sentence orders and what the structure is 
signalling by looking at a few of the prototypical situations that 
would produce a fully salient thetic information structure. Con- 
sider (18) and (19): 

(18) (a) AmArAn 
‘What happened?’ 
(b) axin wan 
‘Moab is in shame’ [V-S] (Jer. 48.19-20) 
(19) (a) 720270 MATA TAR’ 
‘And he said, “What happened, my son?”’ 
(b) TAN TwaAT Tw" 
‘And the herald answered and said,’ 
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(c)  omw)a 1D? ONT? 0: 
““Tsrael fled from the Philistines;’ [V—-S] 

(d) nya nny 77173 naan ON 
‘additionally, there was a great blow upon the 
people;’ [S—v]?° 

(e) 9 Pai yaN wn} 32 -W-oN 

‘additionally, your two sons died, Hophni and 

Phineas,’ [S—V] 

(anes oaoNn TN 

‘and the ark of the Lord was taken.”’ [S-V] (1 Sam. 

4.16-17) 
In the two examples above, an answer is given to a ‘What 
happened?’ question. Both answers could be called thetic from 
an information structure perspective, but structurally they use a 
V-S order. 

These answers could also be called ‘event-central thetic 
sentences’ (Khan and van der Merwe 2020, 361; Sasse 1987, 554) 
as opposed to ‘entity-central’. Khan and van der Merwe (2020, 
361) propose such an example after 737 in (20), but the context 
is problematic: 

(20) (a) Anwy nn 
‘What did you do?’ 
(b) P28 TAN NATTA 
‘Look, Abner came to you.’ [V-S] 


19 The first clause answered ‘what happened’ with V-S order. The 
following three clauses all avoid thematic (‘sequential’) verbs by using 
X-V order. This becomes a list of unordered events rather than a BH 
narrative. See below for discussion on potential rhetorical effects. 
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(c) qn 72m inndw arna? 

‘Why is this that you sent him away and he went off?’ 

(dq) Aa TAN"NY AYT? 

‘You [well] know (gaia, Abner, son of Ner,’ 

(e) xa qnh?°2 

‘that to deceive youjgoarrocusy He came!’ (2 Sam. 3.24- 

25) 
The context in (20) revolves around Abner. More importantly, 
the time of the 737 clause is past and we are seeing the most 
unmarked word order for a past/qatal reading, just like examples 
(18) and (19). An S-V order 81 731x* in (20b) would have been 
normal for a participle and a present tense, but the morpho- 
logically ambiguous x21 is clearly past qatal. In other words, it is 
likely the time of qgatal versus qotel that is affecting the word 
order in (20b), and not the kind of theticity. 

It appears that, elsewhere, fronted constituents are doing 
something other than, or in addition to, marking theticity.”° If 
theticity were semantically defined according to information 
structure, then it would already be the signal. Consider (21): 
(21) (a) ODN wR OW>AWE PPT dR 1d 

‘And they went to the city where the man of God was’ 

(b) Wyn nbyaa DYY nT 

‘and they were going up to the city’ [S—Vpanicipie] 
(c) oD akw> nixy’ ninyi Nya 7AM 
‘and theyjs topic) found girls going out to draw water,’ 


2° Despite the notional definition, Khan and van der Merwe (2020) focus 
on X-V structures, so that they are in broad agreement with this essay. 
Their thetic sentences + Topic sentences would equal sentences with a 
Contextualisation (Topicalisation) structure in my discussion. 
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(d) ia? aN 
‘and they said to them,’ 
(e) AIDA. wD 
‘““Ts there a seer in this place?” (1 Sam. 9.10-11) 

The description in (21b) and (21c) is normal BH for marking 
simultaneity.21 Two ‘unnecessary’ pronouns were generated, 
apparently in order to block a wayyiqtol clause. Throughout the 
context, the same subject is being followed (Saul and his helper) 
without change or Focal saliency. As a follow-up sentence to 
what the participants were doing, it would normally be called a 
categorical sentence within the thetic/categorical dichotomy. But 
what is the ‘artificial Topic’ doing in (21b) and (21c)? It is not 
part of a sentence with equally presentative information struc- 
ture. It actually has a fairly clear Topic and Comment differentia- 
tion that is associated with ‘categorical sentences’, so it is not a 
thetic sentence in that sense. One might argue that it is a special 
thetic by presenting a Background sentence that serves as a new 
setting for the following events. Khan and van der Merwe (2020, 
379) call the sentence 1xyn nan ‘and they found’ in (21c) a 
“resultative situation.” 

But it is simpler to recognise the simultaneity in time sig- 


nalled by the discontinuity of the CC and lack of wayyiqtol.”? This 


21 Fassberg (2019, 85): S31 xwi + Spia-xwii mana ‘with a structure 
“subject-verb” + “subject-verb”’. For additional examples, see Gen. 
29.9; 38.25; Judg. 15.14; 1 Sam. 7.10; 1 Kgs 1.14; Isa. 25.24; Job 1.16. 
2 Contrast the participle + wayyiqtol in 1 Sam. 6.13: on¥p waAW nvr 
TARAS INP) OPPY"NN Nw pava oven-yyp ‘and BetShemesh [were] 
harvesting wheat in the valley and they lifted their eyes and saw the 
ark’. The participle presents the setting and the wayyiqtol advances the 
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is true whether in a past narrative or future description, as in the 
following example. 
(22) (a) J2AToY ow Mata TT Aan 
‘And then, you will be speaking there with the king’ 
(b) JUN NiaN INI 
‘and I will come after you’ 
(c) TAN *nxdn 
‘and will confirm your words’ (1 Kgs 1.14) 
It would not be helpful to talk about the simultaneous entrance 
of the prophet as a resultative situation. The relationship between 
the participle clause and the finite verb clause is the same in (22) 
and (21). In (22), the participle is describing an ongoing progres- 
sive imperfective situation in the future, then the prophet will 
enter at that moment and complete the discussion with the king. 
The simultaneity is signalled by the CC, which prevents the use 


of the foregrounding weqatal.”? 


story with a foregrounded event. However, the wayyiqtol does not have 
the effect of ‘just then’ that comes with the discontinuity of the CC + 
qatal. 


?3 This is described concisely by Fassberg (2019, 84, translation mine): 
“The storyline in the Bible advances through continuities of verbs that 
are connected by conversive-waw. The biblical story teller deviates from 
the continuity (qatal...) wayyigqtol... wayyiqtol and from the continuity 
(yigtol...) weqatal... weqatal when he wishes to signal to the reader that 
this does not deal with a direct development in the story line. The sign 
for stopping the continuity is placing a word before the verb and the 
presentation of the verb without conversive-waw.” 
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3.1.7. Pragmatic Effects of Contextualising Constituents 


Practically, the processing for communication follows a simple 
order. If a constituent is placed before a finite Verb, the author 
has provided a signal for pragmatic effects.** The audience is led 
to recognise and interpret a special signal. If there is a Focus into- 
nation, it is interpreted as specially marked salient information. 
Without Focus intonation, the fronted constituent is a Topic/ 
Contextualisation structure that simultaneously breaks any 
wayyiqtol chain. The signal would be easy to perceive, although 
its application would be multivalent. 

There are many examples where a Focus interpretation is 
ruled out and yet an analysis as a prototypical Topic may not be 
readily apparent: 

(23) (a) inWN TINY YT OTNT} 
‘And the man knew Eve his wife;s_,’ 
(b) teeny 2m 7am 
‘and she conceived and gave birth to Cain’ 
(c) MATHS WROD WANN 
‘and said, “I have acquired a man with the Lord.” 
(Gen. 4.1) 
The Subject 07x7 ‘the man’ is not an extended topic of this verse 
or the following verses, nor does this Subject provide any natural 
framework for these immediate verses. In terms of information 


structure, the whole sentence posits a new situation.” The 


4 Conjunctions are not included in this statement, as they precede the 
clause in their own special position. 
5 It would probably rightly be classed as thetic by Khan and van der 
Merwe (2020) and Atkinson (2021). 
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Subject and Object are equally re-established in this sentence, 
which serves as a setting or starting point for the story that fol- 
lows. The Subject is not marked, salient information, so it is not 
a Focus in FG terms. But in what sense should 0x7 be called 
Topic? Why didn’t the author use inwx 7In nx OTNA YT", which 
would be more of a default for Hebrew narrative? 

This appears to be an example where a Topic/CC is used to 
set off the whole clause, and where the pre-verbal Topic/CC 
prevents the thematic verb structure (wayyiqtol). Within BH, the 
marked constituent simultaneously breaks the continuity of the 
thematic verb system (sometimes called sequential or consecu- 
tive verbs).”° The Topic/CC sets off the whole sentence with prag- 
matic marking, and iconically signals some sense of a break in 
continuity. Consider (24): 

(24) (a) oa nditenn Soa AI 
‘And the Lord threw down,;_y, a big wind to the sea’ 
(b) pea DitaAvO AN 
‘and there was a big storm in the sea’ 
(c) TAWA? NIN TNT 
‘and the boat planned |; .,, to break apart’ (Jon. 1.4) 
The text in (24) shows a similar problem to (23). The Subject 
comes before the Verb, but it does not make a transparent Topic 


for the following verses. It is not a Focus, either, since ‘throwing 


6 These verb categories of wayyiqtol in the past/realis/perfective and 
weqatal in the future/irrealis/imperfective are often called sequential 
verbs. However, they do not uniquely entail strict temporal sequential- 
ity, nor do they induce their Tense-Mood-Aspect from a previous verb, 
so a more abstract term like thematic may be preferable as providing a 
term for unit structure and for foregrounding. 
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down a wind’ is really the newest and most salient information 
in the story at this point. The sentence is not answering an 
implied question (‘who threw down a wind?’), but is presenting 
the wind as a new development, a new setting or scene, but 
without using the normal wayyiqtol syntax (... 717" Dv"). 

These examples, as well as hundreds more, lead to a recog- 
nition that a non-Focal pragmatic positioning can also mark the 
whole clause as a pragmatic signal for the larger discourse and 
not just for the constituent itself. For example, Rashi, one of the 
outstanding medieval commentators, suggested that the word 
order at Gen. 4.1 was used to mark a break in the timeline of the 
story and the conception (and maybe the birth) took place while 
the couple was still in Eden (see Rashi 1986, 68). Such a sugges- 
tion is possible because choosing a pragmatic function that uses 
fronting prevents the use of the most common narrative struc- 
ture, wayyiqtol (the thematic past-tense verb). Thus, the prag- 
matic positioning of pieces of a Hebrew clause is directly linked 
to the verb system. 

Hebrew is a language with an interesting information pack- 
aging strategy. For over 1,000 years, the language used two bina- 
ry, finite verb systems: qgatal (past-realis-perfective) versus yigtol 
(future-irrealis-habitual) on the one hand and wayyigqtol (past- 
realis-perfective) versus weqatal (future-irrealis-imperfective) on 
the other hand, the latter being a ‘thematic finite verb’ system. 
The wayyiqtol thematic verb system provided a packaging struc- 
ture in narrative and past contexts, and the weqatal provided a 


similar structural packaging in future contexts, volitional contexts, 
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and as continuing habitual sentences in the past.”” The thematic 
verb system requires a V-S order. By contrast, the non-thematic 
system has an X-V order. This means that if an author wants to 
use a non-thematic verb and break a thematic sequence, he must 
use a pragmatic constituent between a conjunction and the verb. 
In other words, a Topic (CC) may be used to create a non- 
thematic sentence, as well as to mark a Topic constituent. 

Such sentences may be used in multiple ways. In their most 
basic sense, they break the flow of the thematic sentences 
(wayyiqtol and weqatal). This structure is chosen to signal a break, 
and the break may be (a) a break in time (whether simultane- 
ously or as a flashback); or (b) a break in structure (whether a 
unit boundary with a transitional setting or a parenthetical com- 
ment); or simply (c) a break for rhetorical-literary effect. Both 
(23) and (24) can be interpreted as unit boundaries in the dis- 
course, as the story time advanced forward in each. Similar situa- 
tions occur in the following exmaples. 

(25) (a) DaTaN"Ny 1DI OANA PRA OITA ANS aN 
‘And it happened after these events and God tested 
Abraham,s.y;’ (Gen. 22.1) 

(b)  omas ny NA OM APRA ONITD ANN eT 

The word order in (25a) is S-V, where the Subject does not 


appear to be a Focus as the most salient information, nor is it 


27 With a binary verbal system, it should be assumed that the description 
here is necessarily simplified. A binary tense-mood-aspect finite verb 
system must describe most of the distinctions in the world of 
communication, including time, modality, and aspect, with only two 
verb forms. 
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readily explained as a Topic. Rather, the whole sentence is set 
apart, and here it appears to be like a section heading that 
summarises the following story in 22:2-16. The V-S word order 
in (25b) would make use of the thematic verb system and would 
imply that this is the first foregrounded event that develops the 
story proper after the time setting ‘and it happened after these 
events’. But by using S-V order with a Topic/CC, the author was 
able to signal a discontinuity, setting the whole first sentence 
apart and presenting a summary of the whole story. 

Such a summary sentence can be understood as one of 
many kinds of literary settings before the development of the 
story, as in (26): 

(26) (a) AIAN AMA pay 
‘And the earth was empty chaos’ 
(b) oinn-e-by wn 
‘and darkness [was] on the surface of the deep water’ 
(c) onary nant on>y nm 
‘and God’s spirit was hovering over the water surface’ 
(d) tiga ody TAN 
‘and God said “Let there be light,” 
(e) Tigo 
‘and there was light.’ (Gen. 1.2-3) 
In (26), lines (a) through (c) set the stage for the main event in 
(d) and (e). The S—V order in (26a) avoids a thematic verb and 
avoids implying that this is the first main event of the narrative. 
The whole sentence becomes a setting for the story that follows. 
Likewise, the verbless clause and the participial clause in (b) and 
(c) continue a description of the background setting. They do not 


advance the events of the story, so they do not use the wayyigqtol 
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structure. We thus find a satisfying reading of the authorial 
choice of S—V order in (26a). 

The example in (27) provides a similar case: 

(27) nM qn THA AD" 

‘And Joseph had been taken,s_y, to Egypt’ (Gen. 39.1) 

This verse returns the story in Genesis to Joseph and Egypt after 
a long, multi-generational hiatus in chapter 38. The S-V order 
might be called a Topic in order to return to discussing Joseph. 
However, it also blocks the thematic, wayyiqtol verb system and 
repeats the action that had already been narrated at Gen. 37.28 
and 36. Therefore, the sentence as a whole marks a chronological 
reordering in the story, conveying a pluperfect sense to the 
audience. 

In (28), we might try to explain the S~V word order as a 
Topic/CC in order to compare the sailors’ action and Jonah’s 
action. 

(28) oT Aw APNEA NaP-OKX TH AD" 

‘And Jonah went down,,;,, to the back of the ship and lay 

down and went into a deep sleep’ (Jon. 1.5b) 

However, one should consider that Jonah went below and went 
to sleep before the storm and the actions of the sailors. The choice 
non-wayyiqtol sentence structure and the whole sentence becomes 
a potential break in the timeline of the story. Here, the time most 
probably goes back to an earlier point in the story. 

From the context around (29), we know that Jacob left with 
the livestock, along with Rachel, at a time while Laban was 
already away (Gen. 31.17, 20). 
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(29) (a) qwIa AIS PAX poynde Kind... IPA ATNR 3AM 
‘And he [Jacob] led all his livestock ... to Isaac his 
father in Canaan.’ 
(b) ian tha qn 1291 
‘And Laban had gone,,_,,; to shear his sheep’ 
(ce) Danan ONT 2am 
‘and Rachel stole,ys; the house-gods.’ (Gen. 31.18- 
19) 
It appears, then, that the Subject-Verb order 79 7271 in (29b) is 
not used as a Focus or Topic for ‘Laban’, but to avoid the impli- 
cations of 29 77* ‘and Laban went’. That thematic verb would 
normally have sounded as if Laban’s visit to his sheep was after 
Jacob and Rachel started their journey. But then Rachel could 
not have stolen the idols afterwards (133m), because she had 
already left the homestead with Jacob. The S-V order 79a 721 
provides a signal to the reader that a break occurred and the 
reader can immediately apply this to a prior time for Laban’s 
business trip. 


3.1.8. Dramatic Pause as a Literary Effect 


A remarkable string of S—V clauses appears in (30): 
(30) (a) R730 
‘The morning turned light;, ,,’ 
(b) oa ANI AAD ndw OWN 
‘and the men were sent offjsy,, they*® and their 
donkeys.’ 


?8 In FG, such an afterthought is called a Tail function. 
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(c) PATHS NY? DA 
‘They left the city;s.yj,’ 
(d) apna No 
‘they didn’t go far,’ 
(ec) imaoy Wy? Tox AM 
‘and Joseph said to his house-manager,,y;’ (Gen. 
44.3-4) 
The clause in (30a) has no conjunction and the pragmatically 
marked S—-V sentence might be taken as a setting and a unit 
boundary. However, the second clause in (30b) is also S-V, a 
seemingly unnecessary word order since o°W3xA in>wn* ‘and the 
men were sent off’ would have fit normal thematic verbs and 
would have been expected. The next clause in (30c) is also S-V, 
also without conjunction, and with the S—V order created by what 
might be seen as an unnecessary pronoun, since WyA-nsy ixy* 
would have worked fine. The next clause in (30d) is a negative, 
ineligible for a thematic wayyiqtol verb, and without a conjunc- 
tion. Finally, Joseph’s speech is introduced in (30e) with S—V 
instead of the usual AN". The lack of conjunction in (30a) and 
(30c) gives a sense of a double beginning, while the negative 
clause becomes a parenthetical comment. In parallel with this, 
the avoidance of thematic verbs, despite an appropriate context 
for them, does not allow the story to progress normally. All of the 
verb forms here ‘break’ the obvious forward implications of the 
real-time events. The literary scene appears to have stalled in 
literary time with all of the actions piled on top of each other. If 
that is the intended effect of the author, we may call this a rhe- 
torical use of the pragmatically marked sentences. The story has 


paused with a break extended literarily and syntactically, even 
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though the real time has moved forward.” In literary terms, we 
could call this a kind of dramatic pause, perhaps leading to some 
kind of climax. If so, the suspense is held for considerable time, 
since it is thirty-two verses later, at Gen. 45.3, that Joseph dra- 
matically reveals himself to his brothers. 

The idea of a literary and syntactic pause can explain the 
example above in (24c). One would have expected the thematic 
form 727) 773x7 awnnr* ‘and the boat planned to break apart’. 
However, the S-V order and the seemingly unnecessary CC 
(Topic) present a pragmatically marked sentence that breaks the 
forward movement of the unit and story, despite the real world 
advance in time. It rhetorically and syntactically pauses the story 
at a point of dramatic tension—a dramatic pause. The content, as 


well, leaves the outcome open-ended and on the verge of disaster. 


3.1.9. Multiple Pragmatic Constituents in Hebrew 


With multiple pragmatic constituents before the Verb, there is 
typically a scale from the more presupposed constituent(s), i.e., 
Topic/CC, to the most salient Focus. Consider (31). 
(31) yIx RX "pwWATD 
‘because MY SiNSjoqopic} 1 MySelfis focus) WOuld recognise’ (Ps. 
51.5 [7]) 
The syntactic Object of the sentence is pw ‘my rebellious sins’. 
Its position in the sentence can be explained as linking to the sins 


mentioned in other verses, that is, as a Topic function that uses 


2° See, for example, Esth. 7.6—-8 for a similar long string of non-thematic 
S-V sentences at a climax. Also, Gen. 19.23-25 in the description of the 
destruction of Sodom and Gomorrah. 
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established information to lead into the Comment and newer 
information of the sentence. Then, the explicit Subject ‘T’ is recog- 
nised as pragmatically fronted and as reinforced Focal infor- 
mation that is contra-expected by human nature. 

To have both constituents before the Verb demands an 
explanation in an FG approach to a V-S language. In the example 
in (31), the first is a Topic/CC and the second a Focus. 

(32) tm inv) ty nim 
‘the Lordycc:; will give strength;cc.} TO HIS PEOPLE ppocusy’ 
(Ps. 29.11) 
An FG approach provides for insightful reading. FG lets the audi- 
ence know that there are three pragmatically marked constitu- 
ents in (32), culminating in ‘to his people’. The Psalmist wrote 
this text to exhort the audience to remain with the people of the 
covenant, since that is where the blessings of their incomparable 


God rest. That purpose is reflected in the word order choices. 


3.2. Application to Greek 


In Greek there are structural, frequency, and complexity dif- 
ferences from Hebrew. Nevertheless, the principles of FG and 
marked information structure pay dividends for those reading 
Greek texts. We can illustrate how pragmatics in FG are helpful 
for Greek in similar ways, although many of the issues go beyond 


the scope of a simple article. 


°° ty ‘strength’ is also a word of praise from v. 1. There is a crescendo of 
saliency, leading from the less salient to the more salient. The final, 
most salient item would receive Focus intonation. 
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Several obvious differences from Hebrew can be stated at 
the start. While Hebrew builds texts with finite verb sentences, 
Greek uses much more subordination. Participial phrases hang 
on finite-verb sentences seemingly everywhere. Greek has a more 
granular connective system and uses many more particles in 
shaping and presenting a text. The verbal system in Greek is 
highly aspectual, requiring users to encode aspect in imperatives, 
subjunctives, optatives, infinitives, and participles, as well as 
with finite indicatives in the past.*! In word order, Greek has 
more flexibility, with words attracted phonologically to phrasal 
Heads (enclitics), phrases split and positioned differently (hyper- 
baton), flexible orders with Noun + Adjective, and more usage 
of pragmatic functions. However, those pragmatic functions of 
Topic and Focus provide much information to the practised 


audience. 


3.2.1. Greek is V-S—O 


Greek, like Hebrew, was a V-S language, as linguists working 
both within and outside of FG have concluded.” Here is a 


summary from Runge (2010, 189-90) in support: 


3! The binary verbal system in Hebrew was more time-mood orientated 
and had difficulty expressing aspect in comparison with Greek. See 
Hornkohl (2018); Buth (2019). 

2 Helma Dik (1995, 12) uses a template P1-P@-V-X. The P1 is prag- 
matic Topic, the P@ is Focus, followed by the syntactic template V—X. 
Note that the numbers on the P position are different from those used 
by Runge (2010), although they are referring to the same concepts. 
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Simon Dik has proposed that there are two different pre- 
verbal positions that may or may not be filled in any given 
clause, which he calls Position 1 (P1) and Position 2 (P2). 
... The meaningful distinction between P1 and P2 is based 
largely upon whether the fronted information is presup- 
posed or newly asserted, respectively. 


One or more established (i.e., topical) elements of the 
clause may be placed in position P1. These P1 elements 
establish a new frame of reference, creating an explicit 
mental grounding point for the clause that follows. Posi- 
tion P2, on the other hand, is where newly asserted or focal 
information is placed. 


Dik’s preverbal template is given by Runge (2010, 191) as “(P1) 
(P2) Verb X.” To this we may add another comment from Runge 
(2010, 207): 

Koiné Greek is a verb-prominent language, where the least- 

marked and most basic order of clause components is for 

the verb to be placed in the initial position. When other 


elements are placed in the initial position, such placement 
is motivated by some pragmatic reason. 


3.2.2. Application to Greek Texts 


The example in (33) provides a simple example to reinforce the 
claim that FG can help readers to recognise pragmatic signals 


and, thus, to enhance interpretation skills. 


°3 Here Runge footnotes and acknowledges other terminology that 
covers the same kind of pragmatic function: ‘topicalisation’ 
(Lambrecht), ‘point of departure’ (Levinsohn), ‘contextualising 
constituent’ (Buth). 
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(33) "Ev dpyf nv 6 Adyos, xal 6 Adyos Hv Mpds Tov Bedv, xat Beds Hv 6 

Adyos. 

‘In beginning ;ropicx) Was the Word, and the Word )ropic sub) WAS 

with God, and the Word was God rocus,comp)-, John 1.1) 

The whole sentence is technically new because it starts the book. 
However, the phrase év dpyj ‘in beginning’ is clearly fronted 
before the Verb as a non-Focal constituent, and therefore as a 
Topic/CC. It turns out to be the most presupposed information in 
its clause and provides a framework for the rest of the sentence, 
even though it is not the larger topic of the sentence. The clause 
is a Setting, Background, and Thetic. That is why a name like 
‘Contextualising Constituent’ for the fronted constituent is more 
transparent in English than ‘Topic’ (cf. Runge’s ‘Frame of refer- 
ence’ above). Syntactically, év dpy7 ‘in beginning’ is an adverbial 
complement to the Verb, something labelled ‘X’ in many of the 
templates above. This X has been chosen to function as a broadly 
defined Topic or CC. In addition, the Subject, 6 Adyos ‘the word’, 
follows the Verb and suggests that Greek is a real Verb-Subject- 
Object language. (Obviously, a sample of three clauses can only 
be an illustration, not a substantive argument.) 

In the second clause, the phrase 6 Adyos ‘the word’ can be 
interpreted as a Topic/CC that marks and establishes the Subject 
as the primary topical link of the clause to the sentence. The 
‘word’ is not the new or most salient information of the clause 
and it is not to be read as Focus or with Focal intonation. 

Finally, the phrase §¢d¢ ‘God’ is the new, salient information 
of the clause and it comes before the default V-S order, so it is 
clearly marked as Focus. Again, the order of the Subject supports 


the idea that this is a V-S—O language and not S-V-O. We can try 
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to paraphrase the pragmatic signals of the three clauses in 

English: ‘In the beginning the Word existed. (Let's establish ‘the 

Word’ as our wider Topic): it was with God, and it was, in fact, 

God.’ Presumably, the intonation of the Greek sentence would 

reinforce this reading and make the differences between the CC 

and the Focus transparent. All of this can be processed rapidly 
and subliminally by an accomplished reader or audience. 

These verses in John are uncharacteristically simple for 
Greek, but that makes them good for illustrating how FG allows 
a speaker to encode word order and for an audience to interpret 
the text with the clues of the word order. The next example in 
(34) illustrates both the application of these functions and poten- 
tial ambiguities. 

(34) obtos jv ev dpyh mpds tov Gedv. mavta OV adtod éyéveto, xal ywpls 
abtod éyéveto ode Ev 6 yéyovev. év abt Con qv, xalt } Con Hv TO 
his Tay avOpwmwy: xal TO his ev TH oxotla haivel, xal y cxoTla 
avto ob xatéAaBev. 

‘This;ropicsus) Was in the beginning with God. All 

things ;ropicsupy through him(po-usx; came to be, and outside of 

him/ropic.xj Came to be nothing that came to be. In himpropiexy 
was lifesrocussub/comp)» ANd the liferropicsup) Was the light of 
people, and the lightyropicsub) Shines in the darkness pocus.xj 
and the darkness;zoyi-sup) did not comprehend itjrocusr,onj}” 

(John 1.2-5) 

Every clause makes use of pragmatic functions to help the audi- 

ence follow the thoughts and grasp the points. 

The first clause has a Topic otros ‘this’ that restates items 
from the first verse, and the sentence re-establishes the starting 


base for the points that follow. The second clause may be read in 
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two ways. As a Topic, mdvta ‘all things’ would introduce items 
that are assumed to be part of the beginning. The salient, 
important point in the clause would be that they came into being 
through the Word.* That salient point ‘through him, the Word’ 
was placed before the Verb and given prominence as Focus as 
being contra-expected. A reader/listener could distinguish these 
Topic and Focus placements through the use of a Focus intona- 
tion. On the other hand, mavta ‘all things’ is technically newer 
information than the pronoun ‘through it/him’ and it could have 
been intended for Focus intonation with ‘through it/him’ remain- 
ing as a Topic. Such an order of Focus-Topic is rare.* In the next 
clause, the phrase ‘outside of him’ serves as a Topic that reiterates 
the idea of the previous clause with default word order. 

The Word continues to serve as a pragmatic Topic ‘in him’, 
and the clause introduces marked (i.e., fronted), salient infor- 


mation, ‘life’, as Focus before the Verb. This begins a chain of 


34 | have translated the pronoun as ‘him’, even though it refers to an 
abstract noun. That noun isn’t fully personalised until vv. 10-18. 


*° A fairly unambiguous example of Focus before marked Topic occurs 
at Gal. 3.2, é& gpywv véuou td mvetua eddPete  && dxohe miotews; ‘Did you 
receive the spirit from works of law or from hearing of faith?’ The 
phrases é& Zpywv véuou ‘from works’ and 2& axo%¢ miotews ‘from hearing of 
faith’ are the Focal question constituents. However, ‘from works procs)’ 
comes before the Topical Object 1d mvetiua ‘Spiritiopject topic)» See also 
John 5.46, ci yap émotevete Mwicel, emotevete dv éuol mept yap éuot exetvoc 
éypapev ‘for if you believed Moses, you would have believed me, 
because hejsupropic) Wrote about Mejx Focus]. SO Tavta is functionally 
ambiguous and could have been read with two intonations. 
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clauses in which any salient information becomes a marked Topic 
in the following clause. 

One tricky point of reading comes in the last clause. There 
may be two marked Topics in the clause ‘and the darkness, it, did 
not comprehend’. Both ‘darkness’ and ‘it [the light]’ appear to be 
pre-supposed, established, topical constituents and may not 
receive any special reinforcement, therefore without focal intona- 
tion, leaving the negated Verb as most salient. So, although there 
are two pragmatic constituents before the Verb, there is no Focus 
constituent marked by word order. Alternatively, ‘it? may be read 
with Focal intonation for a coming contrast with ‘him’ in verse 
10. 


4.0. Prospects for Further Application 


Functional Grammar helps identify pragmatic markings in a lan- 
guage. Learning to identify pragmatic material in texts is a gate- 
way to a wider world, whether in linguistic, literary, or biblical 
studies in particular. 

Some researchers may want to work directly in the field of 
formalising grammar and knowledge about human language and 
the iterations that are found in Biblical Hebrew and ancient 
Greek. Many questions remain for formalising syntax. This is true 
for linguistics in general and for FG. Areas like the linguistic side 
of FG, word order, or discourse analysis and pragmatics in either 
Hebrew or Greek continue to develop. There is a continuum from 
studying how stories are constructed all the way down to the nuts 
and bolts of a language. Functional linguistics is poised to address 


that challenge. At the end of the day, these are tools that are used 
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for describing and understanding messages in both their structure 
and their meaning. Research and understanding in both Hebrew 
and Greek literatures are enhanced as one develops a sensitivity 
to the pragmatic signals and language choices that are woven 


into every sentence in a text. 
5.0. Further Reading 


5.1. General Introductions 


Dik (1980, 1-24) 

Anstey (2004) 

Kees and MacKenzie (2015) 
Kees and MacKenzie (2008) 


me Re Be 2S 


5.2. Foundational Texts (Greek-Orientated) 
1. Dik (1995) 
2. Runge (2010) 

5.3. Foundational Texts (Hebrew-Orientated) 


1. Moshavi (2010) 
2. Hornkohl (2018) 
3. Atkinson (2021) 
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COGNITIVE LINGUISTIC THEORY AND 
THE BIBLICAL LANGUAGES' 


William A. Ross 


Study of the Bible has always involved study of its languages. But 
the study of language has not always involved linguistic theory 
as such. It has only been over the past sixty years or so that 
western biblical scholarship has begun to appropriate and apply 
insights from general linguistics to better understand the ancient 
texts. Over that time, the bulk of linguistically -informed biblical 
language research has adopted formalist linguistic theories, such 


as structuralism or generativism.* This trend is due in part to the 


‘I am grateful to Elizabeth Robar, Travis Wright, and my two peer 
reviewers for their useful input on earlier versions of this chapter. My 
thanks go to Joey Hyatt for his capable research assistance preparing 
the bibliography. 

? Taylor (2002, 4-5) concisely explains formalist approaches as those 
that “regard a language as a self-contained system, whose properties are 
encapsulated in a Grammar, i.e. a device which generates, or defines, 
the set of well-formed sentences which constitute the language. A 
general feature of formalist approaches is to regard a language as a 
disembodied object, which is independent, as it were, of the speakers 
who use it and the purposes for which they use it.” More specifically 
with reference to mainstream generativism, ‘formal’ tends to mean 
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fact that formalist theories are older and better known in com- 
parison with the alternatives. But it is also due to the tendency 
of interdisciplinary research never to be truly up-to-date on all 
fronts. That being the case, it is only in the last twenty years that 
biblical scholarship has come into direct contact with Cognitive 


Linguistics, a topic that is the focus of this chapter. 


1.0. The History of Cognitive Linguistics 


It is important to understand at the outset that Cognitive Lin- 
guistics is not a unified field of research. It is, as Geeraerts 
(2006b, 2) puts it, “an archipelago rather than an island,” one 
whose members are described below (cf. Geeraerts and Cuyckens 
2007a, 5-6). Even so, Cognitive Linguistics is a research perspec- 
tive that is characterised by several key theoretical commitments 
concerning language, and these commitments do grant coherence 
and unity to the discipline as a whole, if not total uniformity. As 
discussed further below, it was not until the 1970s and 1980s 
that major developments—and controversies—in general lin- 
guistics at large created the necessary conditions from which the 
foundational pillars of Cognitive Linguistic theory would begin 
to emerge. But to understand why and how that happened 
requires a broader historical perspective than one might expect. 

Cognitive Linguistics is not unique among theories for its 
focus on language as a mental phenomenon. Generativism, for 


example, is also ‘cognitive’ in a basic sense, in that it attributes a 


explicit, non-embodied, and rationalist, which makes semantics describ- 
able with abstract, rule-governed predicate logic. 
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mental status to language.’ But Cognitive Linguistics does rep- 
resent the revival of interest among linguists in the meaning- 
bearing function of language for communication alongside its 
psychological character. In other words, Cognitive Linguistic 
theory is interested not only in knowledge of a language, but also 
in language as a form of knowledge, even an “integral part of 
cognition” itself (Janda 2015, 131). For most of the twentieth 
century, this kind of outlook had receded entirely from view in 
linguistic theorising, in favour of an almost exclusive focus on 
extrapolating abstract structures or rules taken to account for 


language as a self-contained system. But it was not always so. 


1.1. Early Modern Foundations 


Linguistics emerged as a scientific discipline—if one wishes to 
use such terminology—in the nineteenth century, as it became 
distinct from the older and broader practices of philology.* This 


was the era of diachronic, comparative linguistics.” In many 


3 Hence the capitalisation of ‘Cognitive Linguistics’ as a point of distinc- 
tion from other theoretical approaches that also consider cognitive 
aspects of language (Siewierska 2013, 485 n. 1). 


* Linguists themselves disagree over whether and how linguistics can be 
considered a ‘science’, properly speaking. Harris (1993, 11) satisfyingly 
dismisses such bickering, saying that linguistics “is some sort of system- 
atic, truth-seeking, knowledge-making enterprise, and as long as it 
brings home the epistemic bacon by turning up results about language, 
the label [‘science’] isn’t terribly important.” 

° Diachronic study looks at the development of phenomena over time, 
as compared with synchronic study, which looks at phenomena within 
a specific point or period of time. 
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ways, this phase of language study and its particular methods 
arrived upon the broader tide of historical reconstruction and 
comparativism that was rising throughout the academy at large 
in this period (De Maurio and Formigari 1990; Bod 2013, 143- 
83). Scholars tirelessly sought parallels in sound-meaning pairs 
as a way to chart the history of what would come to be known as 
the Indo-European language family. 

Although many of their conclusions have stood the test of 
time remarkably well, nineteenth-century comparative linguists 
were also prone to indulging in wild speculation (see Eco 1995). 
These scholars were heirs to the old notion that languages are 
linked to the thought patterns of the people who use them 
(Robins 1997, 152-206). Some took this idea down dark, more 
deterministic pathways in the intensifying atmosphere of Roman- 
tic nationalism in fin de siécle Europe (see Olender 1992; Turner 
2014, 125-46; Joseph 2020, 145-63). But at a general level, most 
scholars viewed the study of language as concerned with com- 
munication and also as a corollary in some way to the study of 
the mind and therefore to all of human society. Language was 
understood as essentially psychological and thus imbricated with 
human experience (Campbell 2003, 93-94; Geeraerts 2010, 9- 
16). 


1.2. Formalist Peregrinations 


The study of language changed shape at the turn of the twentieth 
century. Perhaps the most important bellwether was the post- 
humous appearance of the Cours de linguistique générale by 
Ferdinand de Saussure (1857-1913) in 1916, which set out to 
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define the task of linguistics. Saussure’s work had lasting effects. 
One was to reorientate linguistic inquiry away from diachronic 
questions of historical reconstruction, which had predominated 
previous scholarship, towards synchronic questions. Another was 
to move away from viewing language as a social phenomenon. 
Instead, Saussure distinguished linguistic behaviour (parole) from 
the supposed abstract system underlying it (langue). Only the 
latter was the proper object of linguistic research, understood as 
a self-contained system of interrelated signs. 

This basic outlook became the foundation of structuralism 
as it took root in America in the work of Franz Boas (1858-1942), 
Edward Sapir (1884-1939), and especially Leonard Bloomfield 
(1887-1949). It was the latter in particular who dispensed with 
the mind as entirely irrelevant to linguistic inquiry and descrip- 
tion. Bloomfieldian structuralism as it was articulated in his 1933 
Language was highly systematic and method-orientated, position- 
ing itself as having no overlap with other disciplines. Behav- 
iourist theory helped justify the anti-psychological posture of 
structuralism, which instead focused on creating mechanisms to 
empirically verify linguistic descriptions of phonology and mor- 
phology. Ultimately, structuralism came to be entirely about the 
signifier but not at all about the signified; always the winter of 
grammar but never the Christmas of meaning. The latter was 
messy and much better left to the psychologists or sociologists 
(Bloomfield 1933, 140; cf. Harris 1993, 16-28; Robins 1997, 
222-59; Campbell 2003, 95-100). 

The face of linguistics changed again in the mid-twentieth 


century as the empiricist outlook of structuralism began to give 
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way to the new theoretical paradigm developed by Noam 
Chomsky (1923-), the intellectual progenitor of transformational 
(later generative) grammar. The history and key commitments of 
generativism are recounted elsewhere in this volume (Naudé and 
Miller-Naudé) and need not be given in any detail here (see also 
Blevins 2013; Freidin 2013). But with Chomsky, linguistics 
became rationalist once again, turning attention back to the 
relationship between language and the mind. In broad strokes, 
Chomsky’s approach focused on syntax and aimed to articulate 
the formal principles that describe how the mind of a speaker 
generates grammatical sentences given the parameters of a spe- 
cific language. These principles were understood as being both 
universal across all languages and innate in the human mind. 
Generativism remained the predominant linguistic framework for 
most of the twentieth century, certainly in North America and in 
many cases elsewhere, until a theoretical parting of ways 
occurred in the 1980s (Robins 1997, 260-63; Campbell 2003, 
100-03).° 


1.3. Cognitive Realisations 


The advent of Cognitive Linguistics is linked with a broader 
movement away from formalist theories that is known as func- 
tionalism (see Buth in this volume). Although generativism is 
interested in cognitive aspects of language as a system for 


expressing thought, it also gives little to no attention to the 


° It is probably correct to say that generativism is still the dominant 
linguistic framework today, although only in certain areas, especially 
syntax. 
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communicative purpose of language, and intentionally so.’ This 
orientation is not unique to generativism but can be traced back 
to Saussure’s disinterest in parole in favour of langue. By contrast, 
functionalist theories focus on language as a means of commu- 
nication and how grammar arises from use of the language. Func- 
tional and Cognitive Linguistics have distinct but very much 
overlapping histories, such that linguists disagree as to whether 
and how the latter is properly understood as part of the former 
(see Van Valin Jr. 2003; Nuyts 2010). Still, the two labels identify 
a set of theoretical frameworks that are rightly understood as 
fellow travellers on a road that has departed from the highway 
of generativism. 

That departure was complete by the late 1980s, after over 
a decade of increasingly heated controversy within Chomskyan 
circles over theoretical developments known as Generative 
Semantics.® Certain participants in the debate were increasingly 
dissatisfied with the level of abstract restrictiveness that genera- 


tivism had reached. In contrast to this formalism, the Cognitive 


7 In the words of Chomsky (2002, 76-77): “Language is not properly 
regarded as a system of communication. It is a system for expressing 
thought, something quite different.” Another example is Chafe (1994, 
8; quoted in Ariel 2010, 76), who states that “there are many important 
things about language that can never be understood by constructing 
sequences of words that begin with John and end with a period, and 
asking oneself whether or not they are sentences of English.” Cited in 
Ariel (2010, 76). My thanks to Travis Wright for drawing my attention 
to these quotes. 


8 See the lively history by Harris (1993), now updated in an excellent 
second edition (2021). 
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Linguistic and functionalist approaches agreed that language is 
not an autonomous mental system but rather is integrated with 
human cognition as a whole.’ One of the leading figures advanc- 
ing this idea in the Cognitive Linguistics movement was George 
Lakoff (1941-), who in 1987 published one of the seminal texts 
in Cognitive Linguistics, entitled Women, Fire, and Dangerous 
Things. It was in that same year that Ronald W. Langacker 
(1942-) published Foundations of Cognitive Grammar, which 
would become another foundational text for Cognitive Linguis- 
tics, alongside The Body in the Mind by Mark Johnson (1987). As 
discussed in the next section, these texts and others soon to fol- 
low set out key theoretical commitments, largely in direct con- 
trast to generativism, that make Cognitive Linguistics uniquely 
centred on the nature of linguistic meaning as part of human 
cognition as a whole. Today, Cognitive Linguistics is widely rec- 
ognised as one of the major frameworks in theoretical linguistics 
as a discipline, and it continues to grow in popularity and appli- 
cation in numerous venues around the world (Taylor 2007, 566- 
71, 574-76; Nerlich and Clarke 2010, 590-92; see also Howe and 
Sweetser 2013, 123-24).?° 


° In generativism specifically, language is handled as a separate module 
within the human mind that is unconnected from other cognitive abili- 
ties, hence it is fundamentally disembodied. 


© For a more in-depth discussion of the origins of Cognitive Linguistics, 
see Ross (forthcoming). 
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2.0. Key Theoretical Commitments and Major 


Concepts 


Cognitive Linguistics as it has developed over the last forty years 
is focused on understanding the communicative function of lan- 
guage, specifically in terms of the experiential basis for struc- 
tured relationships that exist between language and cognition. At 
a basic level, Cognitive Linguistics views language as a dynamic 
but shared repertoire of form-meaning pairings used to symboli- 
cally encode and transmit thought to others. Language in this 
sense involves a “repository of world knowledge, a structured 
collection of meaningful categories that help us deal with new 
experiences and store information about old ones” (Geeraerts and 
Cuyckens 2007a, 5). 


2.1. Four Commitments 


Four key theoretical commitments characterise Cognitive Lin- 
guistics as a whole and help bear out its view of language. The 
descriptions of these commitments below are brief and abstract, 
but are illustrated further below with the six concepts that flow 


out of them. 


2.1.1. Language Arises from Embodied Cognition 


Cognitive Linguistics hypothesises that the human mind has no 
autonomous or innate faculty of some kind where language pro- 
cessing occurs, separate from other cognitive processes, which is 
a basic assumption in Chomskyan generative grammar. Instead, 
Cognitive Linguistics maintains that linguistic knowledge is rep- 


resented in the mind and processed in basically the same way as 
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all other conceptual structures. Linguistic knowledge—the pair- 
ing of form and meaning—is therefore conceptual, an integral 
part of cognition in general, and organised and governed in the 
same ways as the cognitive abilities that are applied in other 
bodily tasks such as visual perception and sensorimotor activity. 
Language is distinguishable as a cognitive ability, but it is not 
unique in terms of the mental processes that are involved. An 
important corollary of this view of cognition is that linguistic 
meaning itself is embodied—not purely rational—since it reflects 
human experience in the world (Croft and Cruse 2004, 2-3; 
Geeraerts 2006b, 4-5; Janda 2015, 132-33)."! 


2.1.2. Language is Perspectivised 


A fundamental commitment of Cognitive Linguistic theory is that 
conceptual structure—and thus also linguistic knowledge—is not 
reducible to mere truth-conditional correspondence to the exter- 
nal world. Rather, language bears meaning because it construes 
the world in a perspectivised, embodied way, imposing a struc- 


ture upon it rather than just reflecting objective reality.’’ So in 


' For example, in his discussion of the word grasp from a cognitive 
scientific perspective, Feldman (2008, 166) explains how “the embod- 
ied neural approach to language suggests that the complex neural 
circuitry that supports [the physical action of] grasping is the core 
meaning of the word.” 


12 A basic example would be the word sunset, which only bears meaning 
(indeed only exists within language) because of the physical organisa- 
tion of external objects relative to human perception. Note that this 
commitment does not entail an endorsement of philosophical relativ- 
ism. From its earliest stages, theorists within the Cognitive Linguistic 
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Cognitive Linguistics, conceptual structure is likewise subject to 
construal in its organisation and how categories are formed. To 
articulate this commitment another way, grammar is conceptu- 
alisation, since linguistic structure reflects conceptual structure.'* 
Cognitive Linguistics thus maintains that language provides vari- 
ous ways of portraying and profiling the information being com- 
municated (Croft and Cruse 2004, 1, 3; Geeraerts 2006b, 4; Evans 
and Green 2006, 40-43). 


2.1.3. Language is Symbolic 


Cognitive Linguistic theory emphasises the primacy of meaning 
in linguistic analysis by assuming that the basic function of lan- 
guage is to express thought and therefore must involve meaning, 
including pragmatic meaning. The way language expresses 
thought is by using symbols, which consist of forms—whether 
spoken, written, or even signed—and meanings with which the 
forms are paired by convention. Linguistic symbols bear meaning 
that is associated not with a particular referent in the external 
world, but rather with a concept or mental representation, which 


itself is derived from categorisation of our experience in the 


movement have argued against both foundationalism and relativism, as 
for example in Johnson (1987, 194-212). He argues (202) that “we are 
in touch with our world but always in a mediated fashion. There is thus 
no single, God’s-Eye way of carving up the world. But it does not follow 
from this that we can carve it up any way we wish.” See also Harder 
(2007, esp. 1253). 


'S Moreover, linguistic utterances are meant to elicit a shared concep- 
tual structure between speakers. 
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external world. Notably, because Cognitive Linguistics hypothe- 
sises that linguistic cognition is indistinct from cognition in gen- 
eral (the first commitment), categorisation occurs not only with 
physical entities but also with language itself. As a result, linguis- 
tic phenomena are not strictly divided into the traditional ‘levels’ 
of phonology, morphology, syntax, and so on. Rather, language 
is viewed as a unified phenomenon for which these terms serve 
as convenient labels for what are in reality overlapping catego- 
ries. (Evans and Green 2006, 6-7, 28-30; Geeraerts and Cuyckens 
2007a, 5). 


2.1.4. Language is the Conventions of Use 


As noted above, Cognitive Linguistics shares with functionalism 
its focus on language as a means of communication and actual 
usage events among speakers. In Cognitive Linguistic theory, 
knowledge of language is understood to emerge from use, such 
that the abstraction of linguistic categories and structures by 
language learners and users occurs inductively. On this view, a 
language is nothing more than the set of form-meaning pairings 
used by convention in a speech community (Croft 2000, 26, 95- 
99).'* Those form-meaning pairings, moreover, occur at every 
level of language, from morpheme all the way up to syntax and 
even discourse. A corollary of this commitment is the unpre- 
dictability of language owing to variability in usage events over 


time, which leads to language change. Language change occurs 


Cognitive Linguistics tends to use the term ‘utterance’ for communica- 
tion usage events, which are understood as particular, actual instances 
of spoken or written forms by a language user (Evans 2007, 217-18). 
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not only because speakers themselves change, but because the 
external world does, too (Croft 1990, 257). At the same time, 
language change is not considered unpredictable. Although 
Cognitive Linguistics is not interested in uncovering linguistic 
universals in the same sense as in formalist theories, it does 
acknowledge universal tendencies in human language use that 
are constrained and motivated by shared cognitive abilities and 
processes, and therefore result in similar patterns of diachronic 
change across languages (Croft and Cruse 2004, 3-4, 71-73; 
Geeraerts 2006b, 4, 5-6).'° 


2.2. Six Concepts 


In addition to these four key theoretical commitments in Cogni- 
tive Linguistic theory, there are numerous concepts that flow 
from them. While there are more than space allows for here, the 
six concepts discussed below are widely considered central to 
Cognitive Linguistics as a framework for the study of language. 
In anticipation of the next section, each concept discussed here 


is illustrated with reference to the biblical text and languages. 


2.2.1. Image Schemas 


Since Cognitive Linguistics understands conceptual structure to 
be grounded in embodiment, semantic structure is reckoned the 
same way. That is what it means for language to be embodied. In 


other words, part of what makes language meaningful is the 


'S On grammaticalisation theory, see especially Narrog and Heine 
(2011), Hopper and Traugott (2012), and Kouteva et al. (2019). 


130 Ross 


human embodied experience with which it is associated.'° One of 
the foundational ways of illustrating and accounting for this 
conceptual association between embodied experience and lin- 
guistic meaning is what Cognitive Linguistics calls the image 
schema, originally developed by Johnson (1987). According to 
Sullivan (2017, 398), image schemas are “simple cognitive struc- 
tures that represent spatial configurations independently of a 
single sensory modality.” They arise directly from repeated sen- 
sory interactions with the world, including the visual, auditory, 
haptic (touch), and vestibular (balance) systems of the body. 
Image schemas are not detailed ideas, but are rather abstract or 
‘schematic’ in nature (hence the name), and thus provide a foun- 
dation for richer conceptual and semantic structures. In this 
sense, image schemas are pre-conceptual and subconscious.’” 

To label and describe image schemas, Cognitive Linguistics 
uses SMALL CAPITALS and simple diagrams, respectively. For exam- 
ple, because the human body has a unidirectional visual appa- 
ratus (i.e., eyes that look in one direction), axial orientation is 
inherently part of embodied experience. This simple reality of 


human embodiment gives rise to a number of image schemas, 


’6 Or, as Evans and Green (2006, 158-60) put it, “Semantic structure is 
conceptual structure,” and conceptual structure is embodied. 


17 In a series of studies, Mandler (1988; 1992; 1996; 2005; 2010) has 
shown how image schemas arise in conjunction with physical and psy- 
chological development during early childhood (even in the womb) 
through what she calls perceptual meaning analysis. See also Evans 
(2014, esp. 118-26; 2015, 122-53). 
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such as FRONT-BACK, LEFT-RIGHT, and—given the universal expe- 
rience of gravity and three-dimensional space—UP-DOWN and 
NEAR-FAR as well. These image schemas are interconnected in 
human visual experience, as are many others. For example, 
Figure 1 shows the CONTAINER image schema at the far left. The 
diagrams in the centre and at right involve the CONTAINER image 
schema as well, but do so as part of the related image schemas 


for IN and OUT, respectively.'® 


Figure 1: Image schemas for CONTAINER, IN, and OUT 


LM LM LM 


@TR @rTr 


Note too the use of ‘LM’ and ‘TR’ in Figure 1, which stand for 
‘Landmark’ and ‘Trajector’, respectively. These terms refer to 
elements that are related in any given construal, but are profiled 
in different ways as either focal (Trajector/TR) or non-focal 
(Landmark/LM). For example, in the centre diagram, the TR 
entity is IN the CONTAINER LM, while in the right diagram the TR 
is OUT (Evans and Green 2006, 176-91; Gibbs and Colston 2006; 
Evans 2007, 106-08; Oakley 2007).’° 


'8 Other image schemas that CONTAINER helps to structure could be 
elaborated, such as TOP-BOTTOM, OVER-UNDER, and FULL-EMPTY, etc. 


'9 There are numerous image schemas that Cognitive Linguistics has 


collectively identified, although these are not exclusive of others that 
may be proposed. See Evans (2019, 235-36) for a synthesised list. 
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The notion of an image schema may seem simple, but 
because it helps analyse conceptual structures it also has signifi- 
cant explanatory power for linguistic structures, as illustrated in 
the clauses in example (1) below. 

GQ) (a WyTA AIP RY 
‘Then Jonah went out of the city’ 
(b) Wy? DTPA aw 
‘and sat east of the city,’ 
(c) maDow id wy 
‘and he made a booth there for himself.’ 
(d) xa Ann aw 
‘And he sat under it in the shade’ (Jon. 4.5a) 
In (1a) Jonah is a TR that is portrayed as OUT of the boundaries 
of the city, which itself is construed as a CONTAINER LM. In addi- 
tion, the verbal event in (1a) is structured by the SOURCE-PATH 
image schema that involves linear motion. Similarly, Jonah’s 
resting place 077 ‘east’ of the city in (1b) involves a CENTRE- 
PERIPHERY image schema in which the city is construed as central 
and Jonah’s spatial position in (1b) as peripheral to the scene. 
(As discussed in more detail in §2.2.5 below, the geographical 
sense ‘east’ for 07p arises from a FRONT-BACK image schema 
involved in the semantic structure of the word.) In (1d), Jonah is 
again a TR, but now profiled against two LMs, one being the 
booth as an elevated SURFACE under which Jonah sits and the 
other being the shade produced by the booth as a CONTAINER in 


which Jonah is located.?° 


20 Note, too, that utterances involve perspective or situatedness in the 
spatial construal (Croft and Cruse 2004, 58-63). In (1) the construal is 
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2.2.2. Frame Semantics 


An important aspect of the commitment to embodied cognition 
in Cognitive Linguistic theory is the related thesis that meaning 
is encyclopedic in nature. That is, semantic structure (form/ 
meaning pairing) is inextricably linked—in fact it is understood 
to ‘grant access’-—to a complex inventory of structured know- 
ledge about the world. That knowledge is encyclopaedic in that 
it derives from both physical and sociocultural dimensions of 
human experience.” The theories of Frame Semantics and Con- 
ceptual Domains are two approaches in Cognitive Linguistics to 
extrapolate this understanding of linguistic meaning. These theo- 
ries are distinct, but in many ways complementary. This section 
focuses on Frame Semantics, leaving Conceptual Domains aside 
until the next section.” 

A semantic frame is a schematisation of experience that 
is represented conceptually and held in long-term memory. In 
essence, a frame is a knowledge structure of interrelated concepts 
associated with an identifiable, culturally-embedded scene in 
human experience. Frames contribute to meaning construction 
by virtue of their gestalt quality. That is, without knowledge of 


the relationship structure of the frame as a whole, knowledge of 


allocentric, taking a kind of bird’s-eye view of the scene, rather than a 
‘Jonah’s-eye view’. On the typological diversity of spatial models of 
construal, see for example Mawyer and Feinberg (2014). 


1 As van Wolde (2009, 51-103) puts it, words are “tips of encyclopedic 
icebergs.” 

2 Frame Semantics originated with Fillmore (1982; 1985), while Con- 
ceptual Domains were developed by Langacker (1987). 
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any given concept within it is incomplete. Fillmore (2006, 373) 
makes this same point, explaining that a frame is “any system of 
concepts related in such a way that to understand any one of 
them you have to understand the whole structure in which it fits; 
when one of the things in such a structure is introduced into a 
text, or into a conversation, all of the others are automatically 
made available.” For example, the concept associated with the 
word ‘Monday’ is linked to, prompts, and can only be fully under- 
stood within the frame WEEK, which is simpler and more basic 
than its parts.”* Although frames are basic modes of knowledge 
representation in this sense, they are not static. Rather, a frame 
is updated, modified, and adapted on the basis of ongoing experi- 
ence (see further Evans and Green 2006, 206-47; Ungerer and 
Schmid 2006, 207-18; Cienki 2007; Evans 2007, 85-86). 
Understanding frame semantics comes more easily by illus- 
tration. In CL, a semantic frame is denoted using SMALL CAPITALS, 
much like an image schema. So, for example, the BAKING frame 
includes categories for at least one participant, the BAKER, who is 
in the BAKING role working with elements like INGREDIENTS and 
TOOLS that themselves have properties like WET, DRY, SHARP, and 
HOT. In an ancient Greek context, the BAKING frame provides the 
background and motivation for categories associated with words 
like cewtdadrts (‘fine flour’), Guy (leaven’), dvaudcow (‘to knead’), 


dvpaua (‘dough’), xAiBavos (‘oven’), and dptos (‘bread loaf’). 


3 | am grateful to one of my peer reviewers for his helpful illustration. 
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An interesting example of the BAKING semantic frame at 


work appears in Matthew 16. The following account appears as 


Jesus and his disciples are travelling by boat: 


(2) 


(a) 


(b) 


(c) 


(d) 


Kal éA@dvtes of uadytai cic TO TEpav emeAdbovto dptous 
AaBeiv. 

‘And the disciples came to the other side and had 
forgotten to take bread.’ 

6 d& Tyoots eimev adtoic- dpate xal mpocéyete amd THs Couns 
THv Dapicaiwy xal Daddovxatwv. 

‘And Jesus said to them, “Watch out and beware of 
the leaven of the Pharisees and Sadducees.””’ 

ot d& dteAoyilovto év Eéautoig A€yovtes Ott Aptous ovx 
éhaBouev. 

‘And they began to discuss among themselves, saying, 
“It is because we took no bread.” 

yvovs 5& 6 Ingots eimev ti Stadoyifecbe ev éautoic, 
dAtyoTtaTol, OTL ApTous ox exETE; 

‘But Jesus, aware of this, said, “You men of little faith, 
why do you discuss among yourselves that you have 
no bread?”’ (Matt. 16.5-8) 


With the note about their lack of bread in (2a), the scene is set 
for Jesus’ comment in (2b). He cautions his disciples against 7 


Guy (‘the leaven’) of their religious opposition. In (2c) the dis- 


ciples, having no bread with them but hearing Jesus mention 


(oun, become confused. Understanding how words grant access 


to semantic frames that structure encyclopaedic knowledge helps 


account for that confusion, as the word @uy naturally prompts 


the BAKING frame, although that is not the right frame for under- 


standing Jesus’ warning. The preceding events involving bread 


and the disciples’ ongoing mental preoccupation with having 
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forgotten to bring any on their journey further explain their ini- 
tial (mis)interpretation in (2c) of Jesus’ words in (2b). But as seen 
in (2d), Jesus’ comment is in fact not straightforwardly about 
bread, but rather about teaching. The connections between those 
two ideas are extensive and rich, as are the implications, and they 
are explored in more detail in §2.2.4 below. Before doing that, 
however, another major concept within Cognitive Linguistics 
needs explanation.” 


2.2.3. Domains and Conceptual Metaphor 


One of the best-known parts of Cognitive Linguistics as a whole 
is conceptual metaphor theory, originally developed by Lakoff 
and Johnson (1980; see Tay 2014).”° In essence, the theory holds 
that metaphor in language is no mere stylistic or rhetorical fea- 
ture, but in fact helps to structure cognition and meaning itself 
via embodied experience. Conceptual metaphor involves two 
domains, a source and a target, the former being mapped or ‘pro- 
jected’ unidirectionally onto the latter such that richer or more 


complex meaning arises through correspondence. A domain is 


4 Further illustration of frame semantics for Hebrew appears in Ziegert 
(2021, 29-31) and de Blois (2004). See also the application of frame 
semantics to English available online at https://framenet.icsi.berkeley. 
edu, accessed 29 June 2023, which also offers a very useful glossary of 
terms. 


?° Cognitive Linguistics has also proposed a theory of metonymy, which 
is not outlined here for reasons of space, though some have proposed 
metonymy as an even more basic cognitive process than metaphor. See 
Croft (2006); Evans and Green (2006, 310-27); Polzenhagen et al. 
(2014). 
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similar to a semantic frame, as discussed above, but is not neces- 
sarily associated with experience of a particular scene. Rather, 
domains are cognitive entities of varying levels of complexity and 
organisation that provide background information against which 
a concept is understood. Typically, concepts are structured with 
multiple domains in a kind of matrix. 

Some domains are basic, deriving directly from embodied 
experience, and are thus pre-conceptual. As such, basic domains 
are similar to image schemas, but the latter are built upon the 
former, which are also not necessarily imagistic. Examples of 
basic domains, which again are denoted in Cognitive Linguistics 
using SMALL CAPITALS, would include SPACE, TEMPERATURE, TIME, 
VOLUME, and COLOUR, among others (see further in Evans and 
Green 2006, 234-35). These domains often provide the source in 
pervasive conceptual metaphors that structure linguistic mean- 
ing, but more complex domains may also appear. What makes 
these conceptual and not merely rhetorical metaphors is their 
motivation and usefulness at the level of thought itself. Several 
examples below will illustrate this theory (Evans and Green 
2006, 230-47, 286-310; Ungerer and Schmid 2006, 114-27; 
Evans 2007, 33-35, 61-62; Grady 2007). 

The first example demonstrates the pervasiveness of certain 
conceptual metaphors across languages and cultures owing to 
motivation by common human physical experience, as shown in 
example (3). 

(3) (a) ap D AAP RATMD pANA awd ID 
‘Let all the inhabitants of the land tremble, for the day 
of the LORD is coming; it is near’ (Joel 2.1) 
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(b) Jayna Siang Oba PVA DW PK D 

‘For a thousand years in your sight are like yesterday 

when it is past’ (Ps. 90.4) 
One of the most pervasive conceptual metaphors is to understand 
TIME in terms of either SPACE or MOTION. In (3a) we see the FUTURE 
EVENTS ARE AHEAD metaphor, which arises from our experience of 
looking in the direction in which we are physically moving, so 
that as entities get closer to us they become visually larger. 
Physical movement through space also involves temporal pro- 
gression, such that arrival at a distant destination corresponds to 
a future point in time. Along these same lines, a converse meta- 
phorical entailment appears in (3b), where PAST EVENTS ARE 
BEHIND. 

The second set of examples is more specific to the con- 
ceptual environment of ancient Israelite prophetic literature, in 
which the relationship between God and his people is portrayed 
as a marriage. This idea appears vividly in Ezekiel 16, where in 
verse 8 God says: “I also swore to you and entered into a covenant 
with you so that you became mine.” In that chapter and others 
throughout the prophetical books there are statements like the 
following: 

(4) (a) Jawovonn) 7a Noam 
‘But you trusted in your beauty and became a whore 
because of your fame’ (Ezek. 16.15) 
(b) mat ab’ my tn pea nnn) n7ag AYE" oy 72 
‘For on every high hill and under every green tree you 
have lain down as a whore’ (Jer. 2.20b) 


At the foundation of verses like these is the conceptual meta- 
phor COVENANT IS MARRIAGE. However, a number of derivative 
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metaphors also emerge as a result of the structure of MARRIAGE— 
at least as it was understood in the ancient Israel—as a concep- 
tual domain. For example, in (4a) God’s people are condemned 
in the broader contexts of the sentences in (4) on the basis of the 
metaphor IDOLATRY IS ADULTERY. The same metaphor appears in 
(4b), but with the additional implication that HIGH PLACES ARE 
SITES OF SEXUAL LIAISON. Of course, both examples in (4) are linked 
to the broader conceptual metaphor WORSHIP IS SEX.”° 


2.2.4. Mental Spaces and Conceptual Blending 


Conceptual blending is a theory initially posited by Fauconnier 
(1994) and further developed by Fauconnier and Turner (2002), 
who argue that meaning is constructed in larger units of language 
(i.e., the sentence level and above) by integrating knowledge 
structures in novel and creative ways that give rise to a ‘blend’. 
Conceptual blending is a basic, effortless cognitive process in 
human thought and imagination that is prompted directly in the 
dynamic context of communication. Again, Cognitive Linguistic 
theory hypothesises that language grants access to encyclopaedic 
knowledge of the external world as a kind of prompt for con- 
ceptualisation. This approach entails that meaning construction 
is grounded in language use, such that there is no principled divi- 
sion between semantics and pragmatics, as in formal approaches 
(Turner 1991, 206; Birdsell 2014).?’ 


7° Further illustration of conceptual metaphor theory for Hebrew 
appears in Ziegert (2021, 31-33). 

?? That is, they are not absolutely distinct. Cognitive Linguistic theory 
places semantics and pragmatics on a continuum of form-meaning 
pairings that may move from the pragmatic pole to the semantic pole 
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Meaning construction through conceptual blending involves 
the integration of mental spaces. Fauconnier (2007, 351) defines 
mental spaces as “very partial assemblies constructed as we think 
and talk for purposes of local understanding and action. They 
contain elements and are structured by frames and cognitive 
models... [and] are connected to each other by various kinds of 
mappings, in particular identity and analogy mappings.” As 
Evans and Green (2006, 369) put it, “you can think of a mental 
space as a ‘thought bubble’.” Mental space theory is distinct from 
conceptual blending theory, but the two are closely related and 
function in a similar way to conceptual metaphor, though with 
important differences. Whereas conceptual metaphor involves 
unidirectional mapping of domains, conceptual blending involves 
selective integration of mental spaces into a novel elaboration. 
Whereas conceptual metaphors are stable and widely shared 
knowledge structures held in long-term memory, conceptual 
blends may be temporary and unique conceptualisations of 
information for creative purposes specific to ongoing discourse 
(although even blends may become conventionalised in long- 
term habitual cognitive structures). 

Consider the example of conceptual blending in (5) below. 
This text is part of the same passage as example (2) above, and 
describes the resolution to the disciples’ misconstrual of Jesus’ 


warning. 


as they become conventionalised in the language over time through 
entrenchment. On pragmatics within Cognitive Linguistics, see Panther 
(2022). 
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(5) (a)  méic od voeite St od mEpl dptwv eimov buiv; mpocexete OE amd 
TIS Couns THY Bapicalwy xal Daddovxatwv. 
“How do you not understand that I did not speak to 
you about [actual] bread? But beware of the leaven 
of the Pharisees and Sadducees.”’ 
(b) tote cuviixav ot ovx elmev mpocéyery and tHe Guys 
TOv &ptwv aA’ and Tio days tTHv Dapicatwy xal 
Laddovuxatwy. 
‘Then they understood that He did not say to beware 
of the leaven of [actual] bread, but of the teaching of 
the Pharisees and Sadducees.’ (Matt. 16.12-13) 
Figure 2 presents a representation of the conceptual integration 
network involved in this passage, which could no doubt be fur- 
ther elaborated. 
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Figure 2: Conceptual integration network in Matt. 16.5-13 


Generic Space 
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EXPANSION \- —e 


“A TEAGHER Is Kenner 
~ ASYNAGOGUE ISA KITCHEN 
- LISTENERS ARE DOUGH 
- etc. 


At the left and right are two mental spaces (called ‘input spaces’) 
represented by circles, each containing properties, roles, and 
relations and each structured by a semantic frame. The input 
spaces share features that are explicated in the generic space at 
the top of the figure. Through conceptually projecting and 
integrating input features, a novel conceptualisation emerges, 
represented at the bottom in the ‘blended space’. Note that this 
mapping is selective; not all properties and elements in the input 


spaces are necessarily involved in the blend (Fauconnier 1994; 
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1997; Fauconnier and Turner 1998; Evans and Green 2006, 363- 
444; Evans 2007, 12-13, 114-15; Fauconnier 2007). 

As noted above in §2.2.2, at first the disciples misunder- 
stand because they are conceptualising Jesus’ comments within 
the BAKING semantic frame alone. But after Jesus reminds them 
that obtaining actual bread for food is not the real problem (Matt. 
16.9-10), they reconceptualise his words in terms of the blend 
portrayed in Figure 2. Jesus’ clarification in (5a) prompts 
dynamic and temporary meaning construction in the context of 
their communication. As the features of each input space are 
integrated in the conceptual blend, novel meaning construction 
occurs. That meaning is represented in a limited way in the 
blended space, where the implications can be conceptually elabo- 
rated. For example, in the blend, the PHARISEES AND SADDUCEES 
(or perhaps just TEACHERS) ARE BAKERS, the SYNAGOGUE IS A 
KITCHEN, and anyone there as A LISTENER IS DOUGH. As Jesus 
explains, TEACHING IS LEAVEN, which has a disproportionate and 
determinative effect upon the outcome of DOUGH, for better or 
worse, when it is baked. In this sense, then, a DISCIPLE IS A LOAF 
OF BREAD. In a context where bread was a major part of daily diet 
and local bakers and bread quality would have been well known, 
this conceptual blend would have had readily accessible explana- 


tory power. 


2.2.5. Prototypes and Semantic Extension 


The cognitive approach to lexical semantics understands words 
as lexical items whose meanings are associated with a complex 


but structured conceptual category (or categories). This view of 
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categorisation was originally posited by Rosch (1978) and 
developed by Lakoff (1987), Taylor (2003), and others. Cognitive 
Linguistic theory hypothesises that conceptual categories form 
because humans gather as much information about our environ- 
ment as possible with the least possible cognitive effort. The cate- 
gorisation process also occurs because humans perceive con- 
sistent correlation between features of the external world. The 
principle of economy gives rise to differing levels of inclusiveness 
for categories and their members, while the principle of correla- 
tion informs the organisation of category members around a 
central exemplar, which is called a prototype. As such, the 
categories are radial, organised around the prototype to include 
other, gradually more peripheral members that are distinct but 
related. Prototype theory applies to lexical semantics insofar as 
any given word also forms a category—held in the ‘mental 
lexicon’—with a prototypical meaning (or sense) at its centre and 
with other meanings extending from the prototype in a semantic 
network.*® This model of lexical semantics integrates other 
aspects of Cognitive Linguistic theory, especially image schemas, 


semantic frames, and conceptual metaphor theory. It is primarily 


28 Note that prototype theory is applicable to both onomasiology and 
semasiology. The former deals with how words are used to categorise 
(or name) objects in the external world, whereas the latter deals with 
the network of concepts (or meanings) of a word understood as a 
category itself. Cognitive lexical semantics deals with both, but the 
discussion below is semasiological. The term mental lexicon refers to 
the inventory of words known by a language speaker, which is 
organised and detailed but nevertheless latent knowledge. See further 
Aitchison (2012); Taylor (2012). 
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by these cognitive mechanisms that Cognitive Linguistics has 
accounted for word meaning as a semantic network in which less 
prototypical senses derive from more prototypical senses through 
motivated (though not always predictable) meaning extension 
(Taylor 2003, 41-83; Evans and Green 2006, 328-63, 445-67; 
Geeraerts 2006c; Evans 2007, 175, 176-77; Lewandowska- 
Tomaszczyk 2007; Geeraerts 2010, 182-272; 2015). 

To illustrate prototype theory, cognitive semantics, and 
embodied cognition, we will revisit 01> from example (1b), 
above. This lexical item is a compound of the preposition jn 
(‘from’) and o7p, the latter of which will come into focus here 
first. The prototypical sense of this nominal appears to be front 
as an embodied, spatial concept. This sense appears in some texts, 
as in the adverbial uses in (6) below: 

(6) (a) camy DT TINK 
‘You encircle me in back and in front’ (Ps. 139.5) 
(b) 1) pageo} TIN) UPR] PAS DTD 1 


‘Look, I go forwards and he is nowhere; 

backwards, but I do not sense him’ (Job 23.8) 
In the HB, the spatial concept front is more often expressed using 
prepositional constructions involving the word 7135 (the human 
‘face’), such as 07199 (‘before’, ‘facing’) or »18(-5y) (‘Lat the] front 
of’). Even so, the front meaning associated with both 715 and 07) 
arose from the embodied construal of the human face as the axial 


front of a person given the orientation of visual perception.” 


2° The same construal underlies the verb 07p ‘to meet, confront’, which 
likely derived from the nominal. 
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Two other senses of 077? extend from the spatial prototype, 
each of which is motivated by different metaphorical construals 
of SPACE and TIME as semantic domains. Owing to embodied expe- 
rience and encyclopaedic knowledge, the period of time catego- 
rised as a DAY is associated with the light of the sUN, which—as 
depicted with the arrow labelled A in Figure 3—follows a con- 
sistent directional trajectory from the temporal BEGINNING of the 


period in the east to its END in the west.°° 


Figure 3: Semantic extension of 07) 


. A 
va? 
al > 
g& V aN 
B 
END/BACK/WEST BEGINNING/FRONT/EAST 


°° Although space constraints prohibit fuller substantiation of this lexi- 
cal semantic proposal, it is noteworthy that the polysemy of the Hebrew 
word 7° supports it, as the word can mean both right (side) or south. 
The former is the spatial prototype; the latter is a metonymic extension 
that can only be motivated within an eastward-facing construal, as I am 
proposing for otp. It seems that eastwardness was construed as the 
unmarked/default directionality (e.g., Zebulun’s boundary runs nn7p 
nmin, ‘forwards, towards sunrise’ to Gath-hepher in Josh. 19.13). In 
addition, eastward orientation was significant in other aspects of 
Israelite culture, as in the geographical orientation of the entrance to 
the Tabernacle and later Solomon’s Temple (like many other ancient 
Near Eastern religious structures) towards the east (see Exod. 26.18-22; 
1 Kgs 7; cf. Gen. 3.24 below). 
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Those earliest and latest temporal periods of the DAY period may 
be metaphorically construed as its spatial FRONT and BACK through 
the cognitively routine conceptual metaphor TIME IS SPACE, or, 
more specifically in this instance, THE BEGINNING IS THE FRONT. The 
texts in (7) illustrate this sense. 
(7) (a) tRQ Popp DTP ITT PwRI AP MIT 

‘The LORD possessed me at the beginning of his way, 

before his deeds of old’ (Prov. 8.22) 

(b)  yianvae? int oT PD P32 

‘Their children will be like before, and their congrega- 

tion will be established in my presence’ (Jer. 30.20a) 
Given the movement of the SUN across the sky during the DAY 
period, this metaphor entails gradedness, such that EARLIER IS 
MORE FRONTWARD and vice-versa. It is in this way that the proto- 
typical spatial meaning of 07) ‘front’ can extend metaphorically 
to the temporal sense ‘before’. 

A second semantic extension occurs, however, when the 
concept of directionality is added (with the affixed preposition 
19) to the spatial construal of a DAY, as depicted with the arrow 
labelled B in Figure 3. The examples in (8) demonstrate this 
meaning, as does (1b) above. 

(8) (a) DTITNN YU? DTPA UA OTR WIM 
‘So he drove out the man and positioned cherubim 
east of the garden of Eden’ (Gen. 3.24) 
(b) p> oTpA Wx AAA-Y Thay PyA Tin Oya min Tiad Sy 
‘And the glory of the LORD went up from the middle 
of the city and stood over the mountain that is east of 
the city’ (Ezek. 11.23) 


148 Ross 


This conceptualisation of 07) is also graded but involves geo- 
graphical positionality of an entity relative to the SUN at the 
metaphorical FRONT of the DAY (0777/). In this way, the third sense 
‘east’ is motivated by the conceptual metaphor EASTWARDNESS IS 
PROXIMITY TO SUNRISE.*? 


2.2.6. Cognitive Approaches to Grammar 


The study of grammar was at the centre of the emergence of 
Cognitive Linguistics out of generativism in the 1970s and 1980s. 
Since that time, two broad approaches to grammar have appeared 
as trunks of a single Cognitive Linguistic tree; distinct but 
grounded in the same spot. On the one hand there is Cognitive 
Grammar, a broad theoretical framework developed by Langacker 
(1987; 1991) that is to date the most influential (see the over- 
views in Langacker 2007 and Bennett 2014). It is also very 
detailed and expansive, to the extent that the introduction by 
Taylor (2002) covers topics ranging from phonology and mor- 


phology to verbal tense and idioms. On the other hand, there is 


31 The use of jn (‘from’) with o7p in this sense indicates a different 
construal of focal and non-focal entities as compared with English 
directional expressions. When correlating entities with cardinal direc- 
tions, native English speakers construe eastward positionality as com- 
pleted movement from the focal entity" to the non-focal entity (e.g., 
Jonah” was [positioned to the] east of the city”). Hebrew speakers, 
however, appear to have had EAST as a directionally-stable third concept 
in the construal, such that eastward positionality of the non-focal 
entity was construed as completed movement from the EAsT® towards 
the focal entity” (e.g., Jonah™ was [positioned from the] east® of the 


city’) : 
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Construction Grammar, the roots of which appear in Kay and 
Fillmore (1999) and Goldberg (1995), and which was later devel- 
oped by Croft (2001) and others.*” 

It is important to note that the six key concepts of Cognitive 
Linguistics discussed so far have been focused on semantics, spe- 
cifically how meaning emerges in linguistic structures from con- 
ceptual structures. Cognitive approaches to grammar use these 
same key concepts to focus more directly on the linguistic system 
itself. In doing so, there are certainly differences between Cog- 
nitive and Construction Grammars, but they nevertheless share 
two guiding principles and are therefore compatible (see further 
Broccias 2006). The first is the symbolic thesis, which holds that 
the basic unit of grammar is a form-meaning pairing as a linguis- 
tic unit. In contrast to formalist approaches, this thesis entails 
that grammatical structure is not treated separately from mean- 
ing. Cognitive approaches to grammar take all form-meaning 
pairings into consideration as a unified and structured inventory 
of conventional linguistic units, from the level of bound mor- 
phemes, to lexical items, to syntactic configurations, understand- 
ing these units as existing along a continuum. The second guiding 
principle is the usage-based thesis, discussed above in §2.1.4. 
Within the realm of grammar, this thesis entails that each lan- 
guage user develops a kind of mental grammar through experi- 


ence, with no sharp division between knowledge of a language 


32 See the overviews in Croft (2007) and Ramonda (2014). See also 
Hoffmann and Trousdale (2013) and the recent work by Hoffmann 
(2022). The earliest Construction Grammar proposal was Fillmore, Kay, 
and O’Connor (1988). 
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and use of a language (Croft and Cruse 2004, 225-90; Evans and 
Green 2006, 475-511; Ungerer and Schmid 2006, 244-56). 

This section will focus only briefly on two examples to 
illustrate Construction Grammar in particular, building on 82.2.2 
above. Construction Grammar helps to account for both ‘irregu- 
lar’ idiomatic expressions as well as ‘regular’ syntactic expres- 
sions as linguistic units called constructions. While the former are 
not discussed here for reasons of space, the example in (9) below 
helps illustrate the latter in terms of the argument structure of 
constructions at the sentence level. 

(9) Xmrayyviabels dE 6 xvptog TOD dovAoU Exeivou améduceEV adtov xal 

TO OdvElov adyxev aves. 

‘Then out of pity that servant’s master released him and 

forgave him the debt.’ (Matt. 18.27) 

This use of apiynut as a ditransitive verb meaning forgive is com- 
mon in the New Testament (cf. Matt. 6.12; 12.31), involving a 
syntactic construction that we will call Forgive Y Z and that can 
be represented as X RESOLVES Y FOR Z (or, even more simply, 
CAUSE-RECEIVE with the Y resolution in view). In this construction, 
X is the AGENT, Y the PATIENT, and Z the BENEFICIARY (SVO“0'"), 
with each argument expressed in the nominative, accusative, and 
dative cases, respectively.* 

Goldberg’s Construction Grammar approach to verb argu- 
ment structure can also help account for sense distinctions by 
virtue of semantic frames. For example, the Forgive Y Z Construc- 
tion in (9) involves the FINANCIAL TRANSACTION frame in which 


°° For further discussion of this construction and how it can be repre- 
sented at semantic and syntactic levels, see Goldberg (2006, 20-22). 
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the conceptual metaphor SIN IS DEBT TO GOD appears. But the verb 
adinut also has a permit sense, which still has three arguments but 
involves a different construction and semantic frame, as shown 
in (10). 
(10) xai odx Atev Aart TA Jatwdvia, bt1 Hdetcav avdtov. 
‘And he would not permit the demons to speak, since they 
knew him.’ (Mark 1.34b; cf. Matt. 8.22; Luke 8.51; Rev. 
11.9) 
The construction here might be called Allow Y to Z and can be 
represented as X ALLOWS Y TO PERFORM Z ACTION. This is a varia- 
tion of the CAUSE-RECEIVE construction, where X is still the AGENT 
and Y the PATIENT, but Z is now another verb in the infinitive 
(SVOD, which will involve its own construction. The semantic 
frame varies depending upon what fills the Y and Z roles, but in 


many cases it is an AUTHORITY or CONTROL frame. 


3.0. Cognitive Linguistics in Biblical Studies 


Because Cognitive Linguistic theory itself is not a single set of 
clearly defined procedures and approaches, as noted above, it is 
no surprise that the use of Cognitive Linguistics within biblical 
scholarship is similarly variegated (Howe and Sweetser 2013, 
122). The earliest application of Cognitive Linguistics to the 
study of the Bible and its languages was the use of conceptual 
metaphor theory by Brettler (1989), a substantial revision of his 
doctoral dissertation. ‘Use’ may be too strong a word, however, 
as the interaction is limited to five total citations of the work of 
Lakoff and Johnson (1980), all of which appear in the introduc- 


tory chapter on method. The word ‘cognitive’ does not appear 
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anywhere in Brettler’s work and even ‘conceptual metaphor’ 
occurs only once (23). 

Still, Brettler’s study brought Cognitive Linguistics in its 
early phase to the attention of biblical scholarship, particularly 
in the study of Hebrew. Green and Howe (2014, 1) call this a 
“first wave,” which was followed in the 1990s and early 2000s 
by anumber of journal articles and conference papers that mostly 
applied Cognitive Linguistic theory to the Hebrew Bible. The 
“second wave” of influence they identify as the formation of ‘The 
Use of Cognitive Linguistics in Biblical Interpretation’ consulta- 
tion at the 2006 annual meeting of the Society of Biblical Lit- 
erature, under the leadership of Mary Therese DesCamp, Joel B. 
Green, Bonnie Howe, and Eve Sweetser. Howe and Sweetser 
(2013, 124-25) give a useful overview of the first six years of 
activity in this group, which ultimately culminated in the pub- 
lication of a volume of collected essays that nicely balances 
Hebrew and Greek studies (Howe and Green 2014).*4 

Biblical scholars have of course continued to employ Cogni- 
tive Linguistic theory in their work to great effect, both indepen- 
dently and in connection with the SBL annual meeting. The fol- 
lowing discussion highlights contributions in each of the respec- 


tive biblical languages. Because Howe and Sweetser (2013, 125- 


34 It is my honour to serve presently as amember of the steering commit- 
tee of the current iteration of this same group, now known as the Cogni- 
tive Linguistics in Biblical Interpretation programme unit. At present, 
members of the committee are working towards an edited volume intro- 
ducing Cognitive Linguistics for biblical scholars that is to be published 
with SBL Press. 
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27) provide a useful survey of Cognitive Linguistics in bibli- 
cal studies up through 2012, this section focuses on work that 
has appeared in the ten years since then, but is by no means 
exhaustive. 

As may be evident already in this section, it is true that 
most of the application of Cognitive Linguistics in biblical schol- 
arship has so far gone to Hebrew. By far the best introduction to 
and overview of this Hebrew scholarship to date is van der 
Merwe (2021). In a bibliography of just over one hundred publi- 
cations dealing with Cognitive Linguistics in biblical studies in 
the last ten years (which is nevertheless surely not exhaustive), 
almost sixty percent of the bibliography relates to Hebrew, with 
just under twenty percent to Greek. Overall, the great majority 
of publications are essay-length, appearing in either journals or 
edited volumes. Despite their narrower scope, these contributions 
do not merely address finer linguistic matters. For example, Ross 
(2019) considers how attention to the conceptual blends con- 
structed in Ps. 51 offers a different and perhaps better under- 
standing of the final verses than has otherwise been considered. 
Many interpreters regard vv. 18-19 (Hebrew vv. 20-21) as a later 
interpolation, assuming that the plea that the LORD would “build 
up the walls of Jerusalem” is an abrupt change of topic that must 
have arisen in a postexilic context. But these verses in fact 
interact and cohere with the entire psalm to prompt a conceptual 
blend in which David himself is Zion/Jerusalem whose damaged 


spiritual walls require restoration by God the builder. This 


3° Key works from these earlier years would include, for example, 
Danove (2001) and van Wolde (2009), among others. 
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application of Cognitive Linguistics thus goes beyond language 
itself to help address interpretive debates and even compositional 
history. 

In the scope of publications, far fewer in number are mono- 
graphs that apply Cognitive Linguistics to the biblical languages. 
When these do appear, such studies tend to be the published form 
of doctoral dissertations (a notable exception here is van Wolde 
2009). As such, these works can be extremely helpful on a 
broader topic, but of course still remain limited by default in 
what they address. A good example here is Robar (2015), who 
employs Cognitive Linguistics to address the function of the 
Hebrew wayyiqtol form at a discourse level to indicate schematic 
continuity (see also Robar 2021). Biblical scholarship has also 
begun to see Cognitive Linguistics applied in part or whole in 
collaborative edited volumes. For example, Ross and Runge 
(2022) present a collection of essays focused on understanding 
the semantics of postclassical Greek prepositions in Cognitive 
Linguistic perspective, particularly using prototype theory, point- 
ing to new possibilities in lexicography and drawing out interpre- 
tative implications. Similarly, the volume edited by Garcia Urenia 
et al. (2022) applies cognitive semantic theory to the lexicogra- 
phy of colour terms related to green within the Hebrew, Greek, 
and Latin Scriptures, analysing meaning at both the lexical and 
symbolic or cultural levels. 

Rarest of all at this point are large-scale works that employ 
Cognitive Linguistics in the more standard or traditional categories 
of biblical studies publications. While there are some grammars, 


for example, that do reflect a much more up-to-date linguistic 
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framework, such as van der Merwe, Naudé, and Kroeze (2017), 
to date none attempt to apply Cognitive Linguistics systemati- 
cally. It is also fair to say that syntax has received virtually no 
attention within biblical scholarship from a Cognitive Linguistic 
perspective, despite the remarkable explanatory power of Con- 
struction Grammar for phenomena that so far have been exam- 
ined only within a generative framework. Lexical semantics has 
fared better than syntax in biblical studies, but still lexicography 
proper has seen comparatively few results in print. However, 
following calls by van der Merwe (2006, 88-89) for attention to 
encyclopedic information in lexical entries, Reinier de Blois has 
been at work editing the Semantic Dictionary of Biblical Hebrew. 
Although the project is ongoing, its approach is based firmly in 
Cognitive Linguistic theory and the initial results are highly 


promising.*© 


4.0. Prospects for Research and Application 


This essay has only begun to outline the theory and potential of 
Cognitive Linguistics, which is now widely recognised as one of 
the major linguistic frameworks, one that continues to grow in 
popularity and application in numerous venues around the 
world. One of those venues has certainly been biblical studies. 
Yet despite the fact that Cognitive Linguistics has been present 
within biblical scholarship for thirty years, its effects and influ- 
ence are far from pervasive, for several reasons. One simple rea- 


son is that, while much of the activity in biblical studies involves 


°° The dictionary is freely available online at  https:// 
semanticdictionary.org/, accessed 4 May 2023. 
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the biblical languages, only a small proportion of scholarship 
focuses directly on refining contemporary understanding of the 
biblical languages themselves.*” Another reason, alluded to at the 
outset, is that even scholars who engage regularly with the lan- 
guages tend not to be acquainted with linguistic theory per se.** 
Knowledge of the differences between structuralism and genera- 
tivism, for example, is nowhere within the expertise of many, 
perhaps most, biblical scholars. Given the scope and complexity 
of linguistics itself, that is rather unsurprising. But the practical 
effect is to leave biblical scholars in the dark as to what sort of 
theoretical framework underpins their favourite biblical lan- 
guage tool and what that might entail, if they even understand 
that there are indeed entailments. It is precisely this situation in 
the discipline that this volume seeks to remedy, at least in part 
(to mix several metaphors) by offering some teaser trailers, land- 
marks for orientation, and goods to test out. 

In the end, however, biblical scholars must learn a hard 
lesson: If we truly wish to understand Cognitive Linguistics as a 
theoretical framework and apply it to better understand the 


ancient languages, it is directly to the primary literature itself 


37 One might add that there is a sizeable portion of biblical scholarship 
that gives little to no attention to the biblical languages at all. Profi- 
ciency in the biblical languages seems ever more to be a specialisation 
unto itself. 

38 There are exceptions that prove this general rule, notably the volume 
by Hornkohl and Khan (2021), which brings together specialists in 
Biblical and Rabbinic Hebrew and theoretical linguists. 


Cognitive Linguistics 157 


that we must go. Although some may worry they have no busi- 
ness or no hope in so doing, it is worth noting that the abundant 
proliferation of introductions, handbooks, and companions over 
the last decade or so has not been limited to areas of biblical 
scholarship. Happily, the same phenomena characterise other 
disciplines as well, including linguistics. On a very simple level, 
then, one of the most promising prospects for Cognitive Lin- 
guistics in biblical studies is for biblical scholars to take up and 
read such resources as are listed in the section below. Under- 
standing and applying Cognitive Linguistic theory is much more 
easily within reach than might be expected. For those already 
acquainted with Cognitive Linguistics—or at least those on their 
way—the prospects for research in the biblical languages and 


application in interpretation of Scripture are virtually limitless. 


5.0. Further Reading 


See the annotated bibliography in Howe and Sweetser (2013, 


129-31). Note also the following resources: 


5.1. Handbooks, Companions, Glossaries 


Dancygier (2017) 

Dabrowska and Divjak (2015) 
Taylor and Littlemore (2014) 
Evans (2007) 

Geeraerts and Cuyckens (2007b) 


Shr geese ee 


5.2. General Introductions 


1. Croft and Cruse (2004) 
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Dirven and Verspoor (2004) 

Evans and Green (2006), now updated by Evans (2019) 
Geeraerts (2006a) 

Ungerer and Schmid (2006) 


ak Ww DN 


5.3. Foundational Texts 


Fauconnier (1994) 
Fauconnier and Turner (2002) 
Johnson (1987) 

Langacker (1987; 1991) 
Lakoff (1987) 

Lakoff and Johnson (1980) 
Talmy (1988) 
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HISTORICAL LINGUISTICS AND THE 
BIBLICAL LANGUAGES 


Kaspars Ozolins 


The student of Biblical Hebrew can observe with relative ease 
that the language of the Old Testament not only differs from 
genre to genre and author to author, but also that it exhibits 
variation across its diachronically diverse writings.’ Likewise, the 
classicist engaging with the Greek of the New Testament will note 
that it differs in important ways from the Greek of earlier time 
periods. Language change is a fact of life, and the biblical text is 
certainly no exception to this. Yet the rigorous application of the 
findings and methodology of historical linguistics to biblical 
studies has been less prevalent and less thorough than might be 
expected. This chapter provides a broad introduction to historical 
linguistics—the study of language change—especially as applied 


to biblical studies. 


' This point, like almost anything in biblical scholarship, is disputed. 
Yet even those who would argue that the Old Testament is entirely the 
product of a single era still claim that certain authors employed an 
archaising style that reflects a diachronically earlier state of the lan- 
guage. See, e.g., Young, Rezetko, and Ehrensvard (2008). 
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1.0. The History of Historical Linguistics 


Although it has important historical precursors, modern linguis- 
tics developed in the wake of the Scientific Revolution of the 
sixteenth and seventeenth centuries in Europe. Historical linguis- 
tics in particular plays a special role in the historical development 
of linguistic inquiry, since the first systematic investigations into 
the nature of language were undertaken by scholars specifically 
interested in language change and the relationships among 
languages.’ 

Initial linguistic investigation, especially in the nineteenth 
century, was focused on the classification and comparison of the 
older languages of Europe and southwest Asia (the language 
family that came to be known as Indo-European). The growing 
impetus for these investigations occurred towards the end of the 
eighteenth century, as Europeans gradually became familiar with 
the languages and cultures of ancient India. In this connection, a 
famous statement from a speech delivered by Sir William Jones 
to the Asiatic Society in 1786 has been frequently quoted 
(Fortson 2010, 9): 

The Sanscrit language, whatever be its antiquity, is of a 


wonderful structure; more perfect than the Greek, more 
copious than the Latin, and more exquisitely refined than 


? There are also other important terms that are used to describe the 
field. One is comparative linguistics, which emphasises the comparative 
method as a means for reconstructing linguistic history (see §2.2.3). 
Another is diachronic linguistics, a term that is perhaps less ambiguous 
than historical linguistics, since the latter could possibly be interpreted 
as the study of the history of the field of linguistics in toto. 
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either, yet bearing to both of them a stronger affinity, both 

in the roots of verbs and the forms of grammar, than could 

possibly have been produced by accident; so strong indeed, 

that no philologer could examine them all three, without 

believing them to have sprung from some common source, 

which, perhaps, no longer exists; there is a similar reason, 

though not quite so forcible, for supposing that both the 

Gothick and the Celtick, though blended with a very differ- 

ent idiom, had the same origin with the Sanscrit; and the 

old Persian might be added to the same family. 
Jones’ somewhat hyperbolic and subjective descriptions of these 
languages might strike the modern reader as being rather quaint. 
Nevertheless, a fundamental axiom of historical linguistics posits 
that descendent languages are related to one another via a now- 
lost parent language. Even geographically and chronologically 
diverse languages can be unexpectedly related. Table 1 compares 
cognates in English, ancient Greek, Latvian (a Baltic language 
spoken in Northern Europe), classical Latin, and Hittite (an 


extinct language originally spoken in present-day Turkey). 


Table 1: Cognates in select Indo-European languages 


English Greek Latvian Latin Hittite 
water bdwp udens Umbr. utur | wadt-ar, -en- 

brother bpatyp brdlis frater negnas 
three TPES tris trés teres 
cow Botic govs bos GUD 

(D) am elut esmu sum esmi 
night vg nakts nox nekuz 


These languages exhibit similar basic vocabulary even though 
they range from the second millennium BC to the present day 
and from Turkey (Hittite) to Northern Europe (Latvian). The 


close resemblances demonstrate a genetic link between these 
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geographically diverse languages (two modern, three ancient) 
and rule out accidental resemblance or borrowing. Following 
Jones, nineteenth-century scholars were preoccupied with the 
relationships of these similar languages and they endeavoured to 
reconstruct their parent language, which came to be called Proto- 
Indo-European. 

However, after the appearance of Ferdinand de Saussure’s 
classic Cours de linguistique générale in 1916, the diachronic 
dimension of language study became more of a minor player in 
linguistics and was replaced by a primary focus in the twentieth 
century on the synchronic study of language.? Such a neat bifur- 
cation is of course an oversimplification, for diachrony and syn- 
chrony are always in relation with one another in language. In 
fact, a renewed appreciation for the centrality of diachrony in 
linguistics has grown in recent decades (see, for example, Bybee 
2010; Hartmann 2021). 

An overview of the history of the discipline will ideally 
focus on the study of the biblical languages, which are naturally 
of primary interest to biblical scholars. The languages of the Old 
and New Testaments (Hebrew, Aramaic, and Greek) belong to 
two linguistic families: Semitic and Indo-European. As such, it is 
appropriate to approach the field of historical linguistics from the 
perspective of these two language families. As it happens, most 
work in historical linguistics has thus far been done within these 


same two language families. 


3 Ironically, some of Saussure’s greatest early work was in the area of 
historical linguistics (see §1.1. below). 
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1.1. Indo-European Studies 


Koine Greek, the language of the New Testament era, belongs to 
the Greek (or Hellenic) branch of the Indo-European family of 
languages. The designation Koine (xotwy meaning ‘common’) 
refers to the lingua franca status of the Attic-Ionic dialect that 
spread throughout the Greek empire, launched by the conquests 
of Alexander the Great. Two earlier major periods in the written 
history of Greek may be identified: Classical/Archaic Greek and 
Mycenaean Greek (late second millennium BCE). The latter is of 
interest to Indo-Europeanists and other scholars because of its 
early attestation and its peculiar writing system, deciphered in 
the 1950s by Michael Ventris (see Ventris and Chadwick 1973). 
The Indo-European family of languages contains several 
hundred languages (depending on classification) and is the 
largest such linguistic grouping in the world, by number of 
speakers. The designation Indo-European is intended to (roughly) 
encompass most of the languages spoken in the lands of Europe 
and Southwest Asia.* The major subfamilies are (in order of 


earliest written attestation): Anatolian, Indic, Iranian, Greek (or 


* The German term indogermanisch is somewhat more precise, since the 
westernmost territory (namely, Scandinavia and Iceland) is inhabited 
by speakers of Germanic languages. (The equivalent term Indo-Germanic 
is now outdated in English.) The extinct Tocharian languages are a 
geographical outlier, having been spoken in the modern Chinese 
province of Xinjiang (in central Asia). A few languages in Europe are 
non-Indo-European, notably the Finno-Ugric languages Estonian, 
Finnish, and Hungarian. Basque (Spain and France) is a famous lan- 
guage isolate, while Maltese is a Semitic language. 
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Hellenic), Italic, Celtic, Germanic, Armenian, Tocharian, Slavic, 
Baltic, Albanian. ° Several Indo-European languages enjoy a 
lengthy period of written attestation (rivalled only by some 
Semitic languages). The earliest attested written records are from 
the extinct Anatolian family (spoken in present-day Turkey), in 
particular, Hittite, dating back roughly to the Late Bronze Age.° 

Early scholarly efforts at reconstructing Proto-Indo- 
European were influenced by the prestige of Sanskrit, the classi- 
cal language of India (equivalent in status to that of Latin in 
Europe). This led scholars to reconstruct Proto-Indo-European 
with broadly similar features, especially in its vowel system. Sub- 
sequent study revealed that Greek, with its inherited five-vowel 
distinction, was in fact more archaic than the three-vowel distinc- 


tion of Sanskrit.” 


° Several larger subgroupings are typically made by scholars. The most 
clear is Indo-Iranian (Indic together with Iranian). Most scholars would 
also group together Baltic and Slavic (Balto-Slavic). Much more contro- 
versial is Italo-Celtic. Additionally, certain shared isoglosses lead many 
scholars to group together Balto-Slavic and Germanic in a northern 
dialect group, as well as Greek, Indo-Iranian, Armenian, and the poorly 
attested Phrygian into a southern dialect group. 


° The earliest attested record of any Indo-European material is found in 
the form of Hittite loanwords in Old Assyrian texts from Kani§, dating 
to the Middle Bronze Age (18th century BCE). See Kloekhorst (2019). 


7 Indo-Iranian (the sub-branch in which Sanskrit is located) distin- 
guishes three vowel qualities ({ d, u), with *é and *6 having merged to 
a. Note that the asterisk symbol is universally used in historical linguis- 
tics to denote a linguistic reconstruction that is prehistoric (i.e., prior to 
written records). 
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A major milestone in Indo-European studies was the recon- 
struction of a class of consonants called laryngeals, first hypo- 
thesised by Ferdinand de Saussure (1879). His rather brilliant 
reasoning is akin to the method now known as internal recon- 
struction.® The empirical existence of these consonants was con- 
firmed by the discovery of Hittite (of the Anatolian branch) in 
the early twentieth century. The laryngeals were partly preserved 
intact in Anatolian, while in other branches they were lost, 
although they left various important traces (such as adjacent 
vowel colouring, compensatory lengthening, and other effects). 
Most scholars today reconstruct three laryngeals, conventionally 
designated *h,, *h,, *h,. Their actual phonetic values are still 
unknown, though many scholars hypothesise that they were 


fricatives of some type, perhaps glottal and pharyngeal.? 


8 In retrospect, one can see a kinship between Saussure’s later 
influential ideas on structuralism and the reasoning he employed here. 


° An illustration of the crucial role that laryngeals play in Indo- 
European linguistics is seen, for example, in the interplay of 
morphological vowel alternations (called ablaut) and root structure in 
the singular and plural perfect stems of verbs like Azimw ‘leave’ (e.g., Aé- 
Aowr-a and Aé-Ait-ouev). Notice that the singular features an o-grade 
vowel in the root, whereas the plural lacks this. In another pair of forms, 
such as 6€-dw-xa (1 sg.) and dé-do-yev (1 pl.), the difference observed is 
a vowel length distinction. However, with the postulation of a laryngeal 
for this second verbal root, the morphological symmetry is restored: 
*de-doh,- (sg.) ~ *de-dh,- (pl.). The loss of the laryngeal was not without 
effects: compensatory lengthening in the singular stem (*de-doh,- > 
dedw-) and vowel epenthesis in the plural stem (*de-dh,- > ded0-). 
Compare the reconstructed stems for AéAoima ~ AeAimoper: *le-loik”- (sg.) 
~ *le-lik”- (pl.). 
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A landmark multi-volume work appearing towards the end 
of the nineteenth century (1886-1900) is Brugmann and Delbriick’s 
Grundrifs der vergleichenden Grammatik der indogermanischen 
Sprachen. This represented the state of current knowledge at the 
time, prior to the discovery of Anatolian and Tocharian early in 
the following century. As it turns out, a considerable amount of 
time would pass before the insights of Tocharian, and especially 
Anatolian, were to alter the picture of Indo-European. 

In recent decades it has become clear that Anatolian was 
the first to break off as a separate speech community (followed 
next by Tocharian). Some of the morphologically rich categories 
common to (Vedic) Sanskrit and Greek are now thought to have 
been later innovations shared by the core inner-IE branches after 
the breaking off of Anatolian (which appears to have lacked 
them). 

A major ongoing area of study is the history and status of 
the so-called -hi and -mi conjugations in Anatolian and Indo- 
European. In core Indo-European, only the familiar -mi conjuga- 
tion (cf. Gk didwut ‘give’) is attested, though, interestingly, the 
long 1 sg. ending -w of thematic verbs is standardly viewed as 
going back to *-o-h, (with the laryngeal ending ultimately related 
to the -hi conjugation).'° Scholarship has demonstrated that the 
perfect tense-stem and the middle voice are clearly historically 


linked to the Anatolian -hi conjugation. The exact relationship is 


1° The so-called thematic verbs (much more common in Greek) belong 
to the same class as -mi verbs; compare the endings of Gk didwu: with 
the thematic verb bhar-a- ‘bear; carry’ in Sanskrit: 1 sg. bhdr-a@-mi (from 
earlier *b"ard, cognate with Gk dépw), 2 sg. bhdr-a-si, 3 sg. bhdr-a-ti. 


180 Ozoling 


controversial and has been much debated. A comprehensive 
theory of the Indo-European verb, developed by Jay Jasanoff 
since the 1970s, has been steadily gaining adherents (see Jasanoff 
2003). 


1.2. Semitic Studies 


Hebrew and Aramaic, the languages of the Old Testament, belong 
to the Semitic family of languages, itself part of a larger grouping 
called Afro-Asiatic (which links Semitic with a number of lan- 
guages spoken in northern and central Africa).’' The immediate 
sub-branch to which both Hebrew and Aramaic belong is desig- 
nated Northwest Semitic. Also classified as Northwest Semitic is 
Ugaritic, an extinct language spoken at Ugarit, a city state on the 
northern Levantine coast.’? 

Semitic linguistics has been shaped by particular con- 
straints and methodological challenges. Semitic languages are 
characterised by a distinctive concatenative morphology, which 
features (mostly) triconsonantal verbal roots (C,-C,-C;) com- 


bined with particular vowel patterns. For example, k-t-b ‘write’ 


" Afro-Asiatic linguistics is still in its infancy, and many of the non- 
Semitic languages are poorly known and poorly attested. Additionally, 
the specific linguistic affiliation of the sub-branches is controversial. 


Discovered by accident in 1928, the archaeological site of Ugarit (Tell 
Ras Shamra) has yielded thousands of cuneiform tablets and other 
significant artefacts. Ugaritic studies over the past century has offered 
important linguistic, cultural, and theological insights into the Old 
Testament. For a broad introduction to Ugaritic studies, see Watson and 
Wyatt (1999). 


Historical Linguistics 181 


combined with a particular vowel pattern to form a verbal adjec- 
tive in Proto-Semitic: *katib- (Hebrew qal ana, Aramaic pe‘al 1n3, 
Ugaritic G-stem katibu, Arabic form I katibun). A different vowel 
pattern with the same root yields a different grammatical form, 
e.g., the G-stem infinitive *katab-. Unfortunately, the reconstruc- 
tion of the various vocalic patterns for both Proto-Semitic and 
intermediate branches is complicated by the fact that many 
ancient Semitic writing systems tended to leave out vowels. The 
writings of the Old Testament, for example, were originally 
written with a (largely) consonantal writing system, which was 
only later supplemented by the medieval Masoretic notation 
system of vowels (niqqud). 

For example, the exact vocalisation of the Ugaritic word sd 
‘field’ must be ascertained from other evidence (since the native 
Ugaritic orthography is largely consonantal). Its reconstructed 
form in Proto-Semitic featured an intervocalic glide (between the 
second root vowel and the case vowel): *sadayum. The outcome 
of these so-called triphthong sequences varied from language to 
language.'? In Biblical Hebrew, for example, III-n nouns (origi- 
nally derived from III-y/w roots) feature word-final segol in 


the absolute (e.g., 77 ‘field’), with regular loss of final *m 


13 The term triphthong (derived from diphthong) is somewhat of a misno- 
mer, as these sequences are better described from the standpoint of 
Semitic phonology as two vowels separated by an intervocalic glide: 
V,wV, and V,yV>. 
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(originally used in Proto-Semitic to indicate unbound nomi- 
nals).'4 Fortunately, among the archaeological finds at Ugarit are 
numerous cuneiform tablets which contain lists of Ugaritic words 
spelled with the logosyllabic orthography peculiar to Akkadian. 
One such example is Sa-du-u ‘field’ (RS 20.123 +), indicating to 
us that the outcome of this triphthong sequence in Ugaritic was 
/-U/. 

Another challenge historical linguists face in dealing with 
Semitic languages has been determining linguistic subgroupings. 
The most important criterion for establishing a subgroup is 
shared innovations between languages. Unfortunately, the sig- 
nificant linguistic contact between Semitic languages in the rel- 
atively small Middle East region has meant that genuine shared 
innovations are often hard to distinguish from borrowings. For 
more detail about the criteria of shared innovations, as well as 
the two major models of linguistic affiliation (the tree and wave 
models), see §2.1.4. 

Important work by Robert Hetzron in the 1970s led to 
mainstream adoption of a Central Semitic node which, in the 
current standard view, includes three main sub-branches: North- 
west Semitic, Arabic, and Old South Arabian (Sayhadic). The 
primary shared innovation justifying this node was the replace- 
ment of the imperfective yaqattal verbal form with the yaqtulu 
form (the ancestor of the familiar yiqtol form in Hebrew). 

Some scholars have rejected Hetzron’s model, pointing out 


that there appears to be a set of shared features justifying a 


The less common biform "tw ‘field’ indicates the original III-y root 
shape. 
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‘south’ Semitic branch.'!° For example, Arabic, Old South Arabian, 
and Modern South Arabian, as well as Ethiopian, all exhibit a 
distinctive *p > f sound change. Furthermore, the phenomenon 
of so-called ‘broken’ (or internal) plurals is widespread among 
these languages, whereas it is less prevalent (and never occurs 
without an overt plural suffix) in Northwest Semitic. Huehnergard 
and Rubin (2011) argue persuasively that these (and other) fea- 
tures are in some cases trivial, while in other cases they provide 
evidence for intensive contact between the ‘southern’ languages 
after the split of Central Semitic from the rest of West Semitic.’® 

The status of some other sub-groupings in Semitic also con- 
tinues to be debated. Potentially of interest to biblical scholars is 
the ongoing discussion about the status of Aramaic, Ugaritic, and 
the Canaanite dialects (including Hebrew) within Northwest 
Semitic (see Pat-El and Wilson-Wright 2018). 


2.0. Key Theoretical Commitments and Major 


Concepts 


In historical linguistics, several crucial theoretical commitments 
are a necessary foundation for the investigation of language 
change in all its dimensions. These are described below, followed 


by an exploration of some of the field’s major concepts. 


'S The ‘south Semitic’ view was standard among earlier generations of 
Semitists. 

'© Kogan’s recent detailed lexical study (2015) of Semitic isoglosses, 
including between Arabic and Northwest Semitic, finds additional sup- 
port for a Central Semitic node. 
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2.1. Theoretical Commitments 


Any scientific discipline contains within it various assumptions 
(sometimes unstated or otherwise unknown to non-specialists) 
without which its specific concepts and methods could not 
be coherent or justified. Four such primary commitments are 
sketched here. 


2.1.1. Pervasiveness of Language Change across All Levels 


Human language is the object of all linguistic investigation. A sine 
qua non within the subfield of historical linguistics is the assump- 
tion that all natural human languages are everywhere and always 
subject to continuous and perpetual language change at all levels. 
Consider the somewhat idealised Figure 1 below. All subsystems 
within a given language (whether phonetics, phonology, etc.) are 
subject to change over time.'” This process is generally gradual 
(although it is not always uniform in intensity), such that speak- 
ers do not generally perceive the change to be very great within 
their own lifetimes. Over a larger timespan, however, such as 
centuries or millennia, the accumulated changes can result in 


drastic differences. 


17 This neat segmentation is of course conceptual, and in reality all 
schools of linguistic thought acknowledge that the boundaries between 
language domains are highly permeable and interconnected (hence 
terms such as morphosyntax, pragmatico-semantic, etc.). 
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Figure 1: Multi-level language change 


jQ]e —-o SESS —h 


Language states 


Phonetics, 


Phonetics, 


Phonetics, 


Phonology, Phonology2 Phonology, 


Morphology, Morphology, Morphology,, 


Syntax, Syntax, Syntax, 


Historical linguists assume that any natural language—even if 


poorly understood or as yet unknown—will be subject to this 
phenomenon. '® 


2.1.2. The Regularity of Sound Change 


The study of sound change lies at the heart of the discipline of 
historical linguistics. One key reason for this is the regularity of 
sound change in all human languages—a far-reaching theoretical 
commitment. Early nineteenth-century observers of language 
change (e.g., Rasmus Rask, August Schleicher, Franz Bopp, etc.) 


8 In fact, one of the distinguishing factors in classifying languages as 
‘living’ (as opposed to ‘extinct’ or ‘dead’) is whether they are (or were) 
in the process of undergoing change. Nevertheless, the classification of 
a language as ‘dead’ is somewhat controversial, given phenomena such 
as the widespread use of Latin in medieval Europe, for example. 
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noted systematic sound correspondences between sets of cognate 
vocabulary, leading them to postulate consistent sound laws to 
account for these differences (see §2.2.3. on the comparative 
method). The consistency of the sound correspondences was seen 
to apply across the board in a given language (for example, the 
same sound change affected verbs, nouns, adjectives, etc., without 
exception). The regularity of sound change is largely what makes 
linguistic reconstruction of prehistoric language states possible. 
In its strongest form, this claim is linked to the nineteenth- 
century German Neogrammarians (Ger. Junggrammatiker), espe- 
cially expounded in Osthoff and Brugmann (1878). 

The actual degree of regularity of sound changes is a topic 
of some controversy in historical linguistics. Some types of pro- 
cesses (for example, dissimilation and metathesis) appear to be 
less than fully regular. Closely tied to this issue is the question of 
the actual locus of sound change. Does it take place mainly 
during the language acquisition process of children? Or is it 
rather to be located in fluent adult speakers? Or both? What is 
the relationship and interplay between ‘regular’ sound change 
and other types of change in language (especially analogy)? For 
our purposes, it is enough to note that most scholars accept a 
general regularity in language sound change (what could be 


perhaps termed ‘qualified exceptionlessness’). 


2.1.3. Linguistic Reconstruction from Sound Change 


Linguistic reconstruction is the systematic recovery of chrono- 
logically earlier language states (often thousands of years earlier). 


This process includes proto-languages, intermediate daughter 
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branches, and all other intermediate language states. The end 
goal of linguistic reconstruction is to provide as comprehensive a 
history as possible for a given language within its language 
family. This activity is a constant work-in-progress for scholars 
engaging with each language family, and in some cases, scholar- 
ship has been working towards this goal for centuries. 

Although all levels of language are subject to change over 
time as seen above, the investigation of sound change (phonetics 
and phonology) in language is agreed to be the major starting 
point and the foundation for the entire enterprise of linguistic 
reconstruction. This is partly due to the regularity of sound 
change, discussed above, but there are additional reasons. Lan- 
guage subsystems exist in a roughly hierarchical arrangement, 
hence sound change naturally becomes the foundation for other, 
higher-level linguistic reconstruction. This is because higher lan- 
guage domains (such as morphosyntax) ultimately terminate at 
the phonological/phonetic domains. 

All syntactic constructions, for example, consist of a dis- 
crete sequence of morphemes. Each morpheme in turn consists 
of a discrete sequence of phonemes (contrastive speech sounds 
within a given language).’® Finally, each phoneme is an abstract 
representation of a set of phones (the final, phonetic domain), 
whose distribution in the language is governed by complex lan- 


guage-specific rules. 


? Naturally, some morphemes in languages can consist of only a single 
phoneme; but a conceptual distinction between the two is still 
necessary. 
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As an example, Figure 2 below illustrates the hierarchical 
relationships in the Classical Greek present participle y:yvoueva 
‘the things taking place’.*® Notice that the complete word (level 
IID) consists of five discrete morphemes in the morphological 
level (ID, of which the first, a present-stem reduplicant gi-, is 
especiallly noteworthy.”! Finally, the five morphemes comprise 


nine distinctive phonemes in the phonological level (1). 


Figure 2: Morphological hierarchy of ytyvoueva 


Ill ylyvoueva 


II gi- -gn- -0- -men- -a 


I /g/ Jif /8/ /n/ Sof sms Sef Sans Sa/ 


However, a consequence of this bottom-up schema is that any 
sound changes will have cascading consequences for higher-level 
domains. For example, the historic loss of the preconsonantal 
phoneme /g/ in post-classical Greek (/gignomena/ > /ginomena/) 
occasioned compensatory lengthening of /i/ > [i:]. This is 


evidenced by the consistent spelling of {1} in this and related 


° This schema is necessarily simplified, and therefore does not capture 
the internal hierarchical structures at each of the three levels. 


21 Not to be confused with the perfect-stem reduplicant C,e- (cf. yéyova). 
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present-tense forms in the biblical manuscript tradition, e.g., 
Luke 9.7: P”? & D {yewopeva}.”? 

This change in the language, although originating at the 
phonetic/phonological level, has consequences for higher levels 
of the language. In place of a clear phonological boundary 
between the present-stem reduplicant gi- and the root -gn-, one 
observes a new opaque present-stem gin-. Speakers would now 
internalise a quasi-suppletive set of verbal stems (e.g., present 
gin- [yetvetat], aorist -gen- [éyéveto], perfect -gon- [yéyove]). This 
new structure is depicted in Figure 3 below. 


Figure 3: Revised morphological hierarchy of yryvoweva 


Ill yelvoueva 


II gin- -0- -men- -a 


I /8/ (NS (nf JO /m/ eS nf Sa/ 


From this illustration one sees the foundational role that sound 
change plays in linguistic structures, as well as how it enjoys 
a certain priority in historical linguistics. Although language 


change can occur in any language domain, the most secure and 


2 See Williams (2018) for a defence of the view that {1} ~ {«1} spelling 
interchanges are not uniformly haphazard, as is usually assumed (part 
of a larger phenomenon known as ‘itacism’). 
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logical foundation for linguistic reconstruction is the phonetics/ 


phonology domain (i.e., sound change).”° 


2.1.4. Language Branching and Speech Communities 


A language or dialect is a more or less unified speech community 
with a set of shared (though unconsciously accepted) conventions 
that constitute a particular grammar. A grammar may be con- 
ceived of as a collection of form-and-meaning pairings. 

The language of such a speech community (say, speech 
community x) undergoes innovations over time, such that all of 
its members (= speakers) essentially hold together linguistically. 
Suppose, however, that a separation takes place within this 
speech community, such that in place of x there are now two 
separate speech communities, y and z (typically separated by 
geography). An important consequence of this development is 
that all subsequent language change in speech communities y and 
z will be distinctive, unless language contact is reestablished 
between the communities. This is the central concept known as 
language branching. See Figure 4 below for a standard model of 


Northwest Semitic. 


3 In practice, this usually means that the most developed linguistic 
reconstructions tend to be phonological in nature, followed by morpho- 
syntax and semantics (though these other, less well-developed areas of 
study in Indo-European historical linguistics have experienced signifi- 
cant advances in recent decades). 
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Figure 4: Standard model of Northwest Semitic 


Proto-Northwest-Semitic 


a 


Aramaic Canaanite Ugaritic 


Hebrew Phoenician Moabite Ammonite 


Over time, the iterative branching process yields a family ‘tree’, 
where each ‘twig’ (or dialect) is linked to a larger series of ‘bran- 
ches’ (or distinctive languages), while all branches ultimately 
converge into a single ‘trunk’ (the proto-language).”* While all 
speech communities are ultimately related to one another within 
a given family, a hierarchical structure relates each language in 
a more precise way that captures the individual relationships 
between any two members. 

Once separated, the different changes that two speech 
communities undergo are generally not predictable beforehand. 
Within a given language family, when speech communities are 
no longer in contact with one another, the changes they subse- 


quently undergo will not always be identical.*° The more time 


4 This specialised field, known as cladistics (from Gk xAddog ‘branch’), is 
also important for several other non-linguistic scientific disciplines, for 
example, textual criticism. 


5 An active topic of discussion among historical linguists is the degree 
to which linguistic trajectories can be mapped out, given the common 
starting point of separated speech communities. In part, this depends 
on whether language change is best characterised as prophylactic or 
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that passes, the more distinctive each speech community becomes. 
Here, the imprecise, yet useful, terms dialect and language apply. 
Given enough time, separated speech communities sharing a lan- 
guage will develop distinctive dialects. These dialects will grow 
progressively less mutually intelligible, until, at some point, they 
are conventionally considered distinctive languages.”® 

In reality, this neat model is complicated by the phenom- 
enon of language contact between distinctive speech commu- 
nities. Prolonged language contact (whether minor or intensive) 
can lead to new shared linguistic innovations over time. These 
innovations (or isoglosses) can create challenges in the estab- 
lishment of linguistic affiliations between the languages of a 
language family (as even in the example of Northwest Semitic; 
see §1.2). 

To capture this reality, an alternative (but complementary) 
wave model has been used by historical linguists to account for 
features and changes that spread throughout any speech commu- 
nities in contact with one another (whether they are linguistically 
homogeneous or not). The metaphor of a spreading wave (or 


better: ripples in a pond) nicely illustrates how language change 


reparative. In other words, does language change primarily occur in 
order to avoid certain phenomena that are unconsciously disfavoured 
by speakers, or, rather, does language change occur as a subsequent tool 
for repairing already existing changes which themselves are 
disfavoured? 


Something of the imprecision in these categories may be observed 
when considering that the Scandinavian languages (Swedish, Norwegian, 
and Danish) are largely mutually intelligible, yet the numerous dialects 
of Modern Arabic or the varieties of Chinese are not. 


Historical Linguistics 193 


is often much more complex than imagined. The wave model is 
popular with dialectologists, especially in the context of the study 
of a dialect continuum (or language continuum). Many linguists 
today do not view the tree and wave models as mutually exclu- 
sive, but in fact as complementary, the former accounting for 
genetic relationships, the latter accounting for subsequent exter- 


nal linguistic influences. 


2.2. Major Concepts 


Building on the previous section, the following is a necessarily 
brief exploration of some of the methods and tools used by his- 


torical linguists in their exploration of language change. 


2.2.1. The Role of Philology 


As we have seen, linguistics was birthed in the context of the 
study of ancient languages, which is described in part as philol- 
ogy. Nevertheless, historical linguistics eventually grew to be a 
separate scientific discipline in its own right, distinct from phi- 
lology. Consequently, the term philology nowadays more properly 
describes the various tools employed by scholars (not just lin- 
guists) in order to study ancient texts. Philology, which has been 
neatly encapsulated as the art of reading slowly, includes diverse 
disciplines such as archaeology, paleography, textual criticism, 
and other historical fields of inquiry. 

These tools are still highly relevant for historical linguists, 
since scholars prize the early written records of a language as a 


means of conducting linguistic reconstruction. Unfortunately, the 
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quality and accuracy of such data are uneven for the documenta- 
tion of a historically prior stage of a spoken language (irrespec- 
tive of whether that language is currently extinct or living). 
Knowledge about writing systems (paleography, etc.) is vital for 
understanding the conventions that native (and non-native) 
speakers chose in representing a given language. Furthermore, if 
an ancient text can only be accessed by scholars as the product 
of a chain of textual transmission (i.e., scribal copying), the 
discipline of textual criticism must be employed in order to sort 
out genuine linguistic data from non-linguistic scribal errors or 
supposed corrections that have arisen in the manuscript tradition. 
In fact, any written representation of language, whether 
ancient or modern, is always incomplete and imprecise due to 
various factors. To begin with, speakers do not require or depend 
on a precise phonetic (or even phonological) transcription of lan- 
guage in order to achieve comprehension. An illustration of vari- 
ous levels of transcriptional detail in the Masoretic system and 
the International Phonetic Alphabet (IPA) is given in the example 
below. 
(1) (a) tax Chis palace’; Dan. 11.45) 
Consonantal representation 
(b) i7T5x 
Approximate phonemic representation 
(c) /?ap:adno/ 
Broad (phonemic) IPA transcription 
(d)  iiTax 
Approximate suprasegmental representation 
(e) [?ap.pad.'no:] 
Narrow (phonetic) IPA transcription 
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Varying levels of detail and accuracy are presented for the hapax 
i75x ‘his palace’ (Dan. 11.45) in the example, on a spectrum from 
(a)-(e). Notice, however, that each non-IPA representation is not 
fully consistent (hence the designation ‘approximate’). This is an 
artefact of complex historical factors. For example, the consonan- 
tal representation is not fully consonantal, since it includes a final 
vowel letter (1-).”” Additionally, while the Masoretic system itself 
accounts for vowel quality and generally indicates supraseg- 
mental stress, it does not record vowel length, even though this 
is generally thought to have been at least partly phonemic in 
Tiberian Hebrew.” 

What is particularly interesting about this example, how- 
ever, is that it is plausibly one of a select few examples of a non- 
aspirated /p/ in the entire Masoretic reading tradition of the Old 
Testament. We know from Latin and Greek transcriptions that the 
normal pronunciation of 5 appears to have been not [p], but an 
Khan adduces testimony from the medieval grammarian Saadya 


Gaon for the hapax legomenon ii75x ‘his palace’ at Dan. 11.45, 


27 The use of vowel letters can be documented from the early first mil- 
lennium BCE, and it is generally thought that the initial point of contact 
was the monophthongisation of particular diphthongs in Northwest 
Semitic. Thus, for example, *mawt {mwt} > mot {mwt} ‘death’ (cons.), 
where w came to represent 6. 


?8 There has been something of a revolution in our understanding of the 
phonology of Tiberian Hebrew in recent decades, largely thanks to the 
detailed work of Geoffrey Khan. For an excellent brief overview of the 
history of scholarly interpretation of the Masoretic vocalic system, see 
Suchard (2018). 
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which he describes as “between bet and pe with dagesh” (Khan 
2020, 215). This appears to be an attempt to capture the hearer’s 
perception of a non-aspirated stop (since bet was not aspirated). 
Khan argues that the reading tradition for this loanword essen- 
tially inherited its non-aspiration from its Old Persian source 
(which lacked aspiration), probably with pharyngealisation, as a 
phonetic approximation of the loanword in Hebrew. Incredibly, 
there is even some corroborating evidence for this pronunciation 
from Latin and Greek transcriptions: e.g., atadavw (Theodoretus, 
fifth century AD).”° 

As can be seen from just one example, historical linguists 
rely heavily on the methods and findings of philology in order to 
do good linguistic work on language states that are inaccessible 
with traditional field work methodology. The soundness of their 
reconstructions and theoretical models in large part depends on 


sound philological work. 


2.2.2. Borrowing” 


Linguistic borrowing is in fact a pervasive feature of many dis- 
tinctive speech communities in human societies, since they are 
frequently in contact (sometimes extensive contact) with other 


language communities. This contact can take many different 


?° For the full argument, as well as more complete evidence, see Khan 
(2020, 214-20). 

3° This discussion on borrowing is deliberately placed prior to the sec- 
tions that follow because discerning and eliminating borrowed linguis- 
tic material is an important criterion for establishing legitimate cor- 
respondence sets when using the comparative method. 
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forms. Sometimes it is limited and sporadic; at other times and 
contexts it can be significant, such that a high level of bilingual- 
ism exists between two speech communities.*! The term borrow- 
ing generally refers to lexemes and semantic notions which are 
transferred from one speech community to another. (Note that 
this excludes other types of linguistic influence, such as syntax, 
which are no less genuine.) 

The actual process of borrowing, like with other diachronic 
aspects of language, is not instantaneous. A borrowed lexeme 
begins life as a recognisably foreign element that over time may 
become progressively integrated into a speech community, such 
that eventually its users will no longer perceive it as foreign, but 
perfectly native. When examining lexical material in a speech 
community, or even in an ancient text (say, that of the Bible), it 
is important to ascertain where on the scale of adoption a given 
lexical item is diachronically situated (at the time of its usage). 
Depending on the analysis, foreign words in a text might be 
described either as loanwords or alternatively as an example of 
code-switching. 

A helpful illustration for understanding borrowing is the 
so-called ‘Latinisms’ in the Gospel of Mark. As shown in Table 2, 


these include the following (adapted from Zeichmann 2017): 


3! Various complex factors (linguistic and non-linguistic) can play a role 
in determining the degree of contact, such as the linguistic relatedness 
of the two speech communities, their relative social ranking, and the 
degree of societal integration between them. 
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Table 2: Latin-Greek Borrowing 


Latin Greek References (GMark) 
grabatus ‘cot’ xpaBatos 2.4, 9, 11, 12; 6.55 
modius ‘a measure’ uddtov 4.21 
legio ‘legion’ Agytov 5.9, 15 
speculator ‘scout’ omexovAdtwp 6.27 
dénarius ‘a day’s wage’ dyvaptov 6.37; 12.15; 14.5 
pugnus ‘fist’ TUyEy 7.3 
sextdrius ‘a measure’ Eéotns 7.4 
cénsus ‘census’ KHVOOS 12.14 
Caesar ‘Caesar’ xaicap 12.14, 16,17 [2x] 
quadrans ‘a coin’ xodpavTns 12.42 
vae ‘woe’ ovat 13.17; 14.21 
flagelld ‘to whip’ bpayedrow 15.15 
praetorium ‘palace’ TPALTWPLOV 15.16 
centurio ‘centurion’ XEVTUPLOV 15.39, 44, 45 


The presence of so many Latin words in Mark’s Gospel has been 
taken by many commentators as fairly strong evidence for a 
Roman (i.e., Western) provenance for the writing. Other scholars, 
however, argue for a Syrian or Palestinian setting. Zeichmann 
examines the character of these Latinisms and concludes that 
they are poorly internalised borrowings. Some of the senses Mark 
uses for the Latin words are quite atypical (for example muy as 
‘hand’). The very presence of Latinisms most likely indicates a 
post-70 CE writing date if in Palestine (when the Roman occu- 
pation, the use of Latin, and the prevalence of Roman items, such 
as coins, increased dramatically). On the other hand, Mark’s 


rather rough and irregular use of Latin words may indicate only 
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a rudimentary acquaintance with the language, which is con- 
sistent with a Roman provenance.” 

Another application of studies in loanwords and linguistic 
borrowing for biblical studies is the use of loanwords to obtain 
relative dates for the composition of a text. If the historical time 
period during which cultural and linguistic contact took place 
between two speech communities is known, in theory approxi- 
mate dates of composition could be assigned to texts containing 
particular loanwords. For example, the presence of Greek loan- 
words in the book of Daniel has frequently been used as an 
argument in favour of a Hellenistic-era composition. A recent re- 
examination of the evidence by Benjamin Noonan (2018), 
however, argues that the phonological makeup of these Greek 
loanwords (mainly musical terms) indicates that they were bor- 
rowed from a non-Attic source (i.e., not the dialect which spread 
as a consequence of Alexander’s Hellenisation). This debate illus- 
trates that linguistic evidence can play a role in assessing the date 
of a text’s composition, but that one should be cautious in relying 


on such evidence for a conclusive decision (see §3.0 below).** 


32 Zeichmann himself favours a post-war Syrian or Palestinian prove- 
nance, though this is on the assumption that “the author of Mark was 
[not] a Judaean denizen writing from Rome or a refugee of the Jewish 
War” (Zeichmann 2017, 47). 


°3 A vivid illustration of such overconfidence is the famous assessment 
of S. R. Driver (1900, lxiii, emphasis original) with regard to the date 
of Daniel: “The verdict of the language of Daniel is thus clear... the 
Greek words demand, the Hebrew supports, and the Aramaic permits, a 
date after the conquest of Palestine by Alexander the Great.” 
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2.2.3. The Comparative Method 


The comparative method has already been mentioned, but in this 
section it is explained in more detail, especially as a process. This 
is one of the most important tools historical linguists make use of 
when conducting research into a family of languages. As its name 
suggests, the method involves a comparison of assembled lexical 
items between two or more genetically related languages. It is 
important to state at the outset that the method is necessarily 
iterative by nature, as several assumptions required by the 
method are themselves clarified in the course of implementing 
the selfsame method. In the initial phase, of course, it may not 
necessarily have been established that the languages in question 
are in fact related, as superficial resemblances are not enough to 
establish such a link (since they could be chance similarities). 
Hence, there is a necessary element of circularity in the method; 
an initial provisional hypothesis is usually sufficient to begin. The 
pervasive presence of a core set of vocabulary that appears to be 
shared between two languages is a valid starting point for the 
application of the comparative method. 

The first step involves the assembling of a list of lexical 
items which are plausibly cognate with one another. At this stage, 
it is critical to exclude any borrowed lexemes in any language, 
since they would inevitably skew the analysis (see the previous 


section). This, too, is not always easy to accomplish, as there is a 


34 Sometimes the similarities between two languages have been much 
obscured by many layers of sound change. For example, it was not 
immediately obvious to scholars that Old Irish was Indo-European until 
the meticulous work of Franz Bopp. 
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degree of circularity in ascertaining which lexemes are borrowed 
and which ones are inherited.*° Table 3 below shows a sample 
correspondence set between Hebrew, Aramaic, and Ugaritic.* 
Although of course these three languages are now known to be 
genetically related to one another, it is helpful to see exactly how 
this claim can be established. As one examines the list of corre- 
spondences, certain observations emerge. (In the following, only 
the voiced coronal consonants /d/, /z/, and /6/ will be exam- 
ined, though the correspondence sets above also reveal other pat- 
terns.) Not only do most of the words look fairly similar to one 
another, but particular patterns are evident. 

For example, sometimes all three witnesses agree in sharing 
the same consonant /d/: 07 | xn7 | dm ‘blood’. This is also found 
in witp | wtp | qd ‘holy’ and ty | “tax | smd ‘holy’, as well as 
n>q | xn>t | dit ‘door’.*” In another set, there is also agreement 
among the three languages with regard to /z/: nt | xnvt | zt ‘olive’. 
However, sometimes the languages disagree: nar | nat | dbh 


‘sacrifice’ and ant | 197 | hrs ‘gold’. (Further examination reveals 


35 In some cases, heavy lexical borrowing can eliminate much core 
vocabulary, which presents a unique set of challenges to scholars. One 
such example is Armenian, whose vocabulary is much more strongly 
shaped by Middle Iranian languages than English is by Norman French. 
It took quite a bit of careful work in the late nineteenth century, for 
example, to establish that Armenian constitutes a unique branch of 
Indo-European and that it is not an Iranian language. 


3° This example concerns the reconstruction of Proto-Northwest- 
Semitic; see Figure 4 in §2.1.4 above. 

37 Note that vowels are included in Table 3 only where there is evidence, 
but are omitted for verbs. 
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that in the last set, Ugaritic hrs must be removed from consider- 
ation as it is clearly cognate with a poetic equivalent pron in 


Hebrew; the cognate of 1n1 has apparently been lost in Ugaritic.) 


Table 3: Example cognate words in Hebrew, Aramaic, and Ugaritic 


BHebrew BAramaic Ugaritic 

a. am ‘gold’ ant ‘gold’ hrs ‘gold’ 
b. nt ‘olive’ JArm. xn" ‘olive’ at ‘olive’ 

c. nt dem. pron. *T rel. pron. d/d rel. pron. 
d. not ‘door’ JArm. xn97 ‘door’ dit ‘door’ 
e. apw ‘gate’ yan ‘door’ tér ‘gate’ 

f, 07 ‘blood’ JArm. x7 ‘blood’ dm ‘blood’ 
g. nat ‘sacrifice’ nat ‘sacrifice’ dbh ‘sacrifice’ 
h. wip ‘holy’ wtp ‘holy’ qd5 ‘holy’ 

i. wow ‘three’ nbn ‘three’ tlt ‘three’ 

de nnn ‘below’ ninn ‘under’ tht ‘under’ 
k. ay ‘rock’ 10 ‘mountain’ gr ‘mountain’ 
I; oy ‘image’ oy ‘image’ sim ‘image’ 
m. Ps ‘earth’ Prx/yrs ‘earth’ ars ‘earth’ 
n. mine (?) Tiwn ‘darkness’ elmt/zlmt ‘darkness’ 
O. Tay ‘yoke’ JArm. x7nx ‘yoke’ smd ‘team’ 
p. pnn ‘strike’ nnn ‘strike’ mhs ‘fight’ 


Once this is done, however, there remains a discrepancy between 
the /d/ of Aramaic and Ugaritic, on the one hand, and the /z/ of 
Hebrew, on the other. A final correspondence set, the relative/ 
demonstrative pronoun 71 | °7 | d/d reveals that Ugaritic appar- 
ently features two (dialectal) outcomes of the same segment. All 
these correspondences are then conveniently tabulated on a sep- 
arate chart (Table 4 below). The next step is the analysis of the 
sets of segments in order to propose a single reconstructed pho- 
neme per set. The full agreement between the three languages 
with regard to sets 1 (/z/) and 3 (/d/) allows us to hypothesise 
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that the immediate ancestor of the languages (Proto-Northwest- 
Semitic) featured a phonemic inventory with the same two seg- 
ments: *z and *d. 


Table 4: Segment correspondence in Hebrew, Aramaic, and Ugaritic 


Hebrew Aramaic Ugaritic 
Set 1 /2/% /2/r Z/2/ 
Set 2 /2/t /d/7 d/6/ ~d/d/ 
Set 3 /d/7 /d/7 d /d/ 


This is the most economical explanation, as it would be highly 
unlikely that each language independently innovated in the same 
direction, resulting in such correspondences. 

Set 2, however, presents some challenges. Which segment 
ought we to reconstruct? In the majority of cases, Ugaritic agrees 
with Aramaic in sharing a /d/ segment (over against the /z/ of 
Hebrew). The interesting case of the relative pronoun d /6/, how- 
ever, reveals that in at least some cases, the reflex of this corre- 
spondence set in Ugaritic was a voiced interdental fricative. A 
feature analysis (Table 5) shows that /6/, although a fricative, is 
not a sibilant like /z/, and is therefore situated midway between 
/d/ and /z/. Cross-linguistic patterns further show that it is quite 
common for interdental fricatives to become sibilants (e.g., *6 > 
/z/; viz. Hebrew), or else to be ‘strengthened’ to a stop (e.g., *d 


> /d/; viz. Aramaic). 
Table 5: Feature analysis of /z/, / 6/, and /d/ 


Coronal | Voiced | Sibilant | Stop 
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As there is nothing apparent in the environment of these words 
(from the correspondence sets) to motivate a change in Aramaic 
or Ugaritic, we must instead hypothesise that an unconditioned 
merger took place in Hebrew between *6 and *z (> /z/) that is 
absent in Aramaic. Correspondingly, a separate unconditioned 
merger must have taken place in Aramaic between *6 and *d (> 
/d/). The reconstructed segments in each of the correspondence 


sets are given below in Table 6 in the left column. 


Table 6: Reconstructed segments of *z, *6, and *d 


Hebrew Aramaic Ugaritic 
*Z 1/z/ /2/ 2/2/ 
*6 /2/ w/d/ d /6/ ~ d/d/ 
*d w/d/ s/d/ d/d/ 


The results of this analysis indicate that Proto-Northwest-Semitic 
featured a series of three voiced coronal phonemes, which were 
reduced to two in Hebrew and Aramaic (though in different 


ways). 


2.2.4. A Typology of Sound Change 


The previous discussion illustrated the importance of understand- 
ing the phonology of the segments in correspondence sets assem- 
bled between cognate languages. Additionally, knowledge of com- 
mon patterns of sound change in other languages provides valua- 
ble information for the comparative method. This section pro- 
vides a brief overview of various types of phonological processes. 

A pervasive phenomenon in sound change is lenition, or 
consonantal weakening. In general, consonantal segments fre- 
quently tend to change along certain common pathways, poten- 


tially leading to segment loss. In table 7 below, three different 
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places of articulation (bilabial, alveolar, and velar) are arranged 
in parallel to show common pathways towards © (though other 
pathways are also possible). For example, a common process in 
languages is for geminated segments to be degeminated. Voice- 
less (non-geminated) stops will often become voiced in the envi- 
ronment of surrounding vowels (which are usually voiced in most 


languages). 


Table 7: Pathways towards consonantal weakening 


Bilabial | Alveolar Velar 

Geminate /p:/ /t:/ /k:/ 
Degemination /p/ /t/ /k/ 
Voicing /b/ /d/ // 
Spirantisation /B/ /6/ /y/ 
Debuccalisation /h/ /h/ /h/ 
Segment loss D D D 


The final stage prior to segment loss is called debuccalisation, in 
which the oral segment loses its original place of articulation, 
moving to the glottis (/h/).** 

Consider an example of lenition from Biblical and Byzantine 
Greek. The spelling of StAovavot (m. gen. sg.) ‘Silvanus’ at 2 Cor. 
1.19, a loanword from Latin Silvanus /silwa:nus/, varies some- 
what in the manuscript tradition (in which we also find lunate 
sigma): 

(2) (a) ctAovavou 
P46°xABCKLP ¥ 1739 rell 
(b) ctABavou 
P46* DFG 


38 A far less common cross-linguistic process than lenition is its reverse, 
called fortition. The change of *6 > /d/ in Aramaic is one such example. 
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The spelling ctABavou in (2b) indicates that for some later Byzantine 
scribes, the phoneme represented by {6} had already shifted from 
a voiced stop /b/ to a fricative /B/, which was perceived as the 
closest rendering of the Latin proper name (with /w/).°? 

Another common category of sound change is the loss of 
vocalic segments. Syllables which have lost articulatory promi- 
nence in words tend to lead to vowel reduction, and finally loss, 
which is called apocope. The loss of word-internal vowels is called 
syncope. A pervasive example of apocope in Hebrew is the loss 
of old Semitic case vowels (e.g., nominative *malk-u, genitive 
*malk-i, accusative *malk-a, all losing their final vowel in 721 
‘king’).*° 

The juxtaposition of a consonantal segment next to a follow- 
ing sound articulated close to the palate of the mouth (whether a 
vowel or consonant) leads to a common sound process in many 
languages called palatalisation. For example, in Pre-Greek, the co- 
occurrence of certain consonants before the palatal glide /j/ 
(usually spelled as a y in English) led to a series of widespread 
sound changes, including changes in place of articulation (/l/ > 
/\/, with a palatal co-articulation), changes in manner of artic- 


ulation (stop > affricate), among others. The examples in (3) 


3° The IPA symbol for a voiced bilabial fricative is a Greek beta. 
Manuscript data taken from Royse (2007, 852). Note that the use of 
{ov} is itself an attempt to render Latin /w/ in Greek. 


“© An intermediate step was the form *malk, which subsequently 
developed an anaptyctic vowel to break up the final consonant cluster 
-Ik. This widespread phenomenon in Hebrew is termed segolisation (after 
the name of the final vowel seen in the so-called segolates). 
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below in Greek are illustrations from various present formations 
with the Indo-European *-jo- suffix (contrasted with aorist): 
(3) (a) “*bal-jo- > *balo- > *bakio- > Bd))w ‘throw’ 
(present); *(e-)bal- > éadov (aorist) 
(b) *klep-jo- > *klepio- > *kleptfo- > xAémtw ‘steal’ 
(present); *(e-)klep-sa- > éxAea (aorist) 
(c) *ptulak-jo- > *p*ulakio- > *ptulatfio- > dvdAdcow 
‘guard’ (present); *(e-)p"ulak-sa- > épvraka 
(aorist) 

A very common process in sound change is the assimilation 
of a segment (or segments) in part or in whole to the features of 
an adjacent segment. For example, in Biblical Hebrew hitpa“el 
stems, certain features of the root-initial segment spread back- 
wards to the alveolar t of the stem prefix. This is referred to as 
regressive assimilation (the reverse is called progressive assimila- 
tion). For instance, the verbal root 127 formed a hitpa“el with 
regressive voicing assimilation: e.g., *mit-dabir- > *mid-dabir- > 
Nata (Num. 7.89). 

Sometimes this assimilation in hitpa“el stems was combined 
with yet another (often seemingly sporadic) process called meta- 
thesis, which involves the exchange of two or more segments, 
sometimes even when they are non-adjacent. This is a well-known 
phenomenon in hitpa“el stems with roots which begin with a 
sibilant. For example, the verbal root pty ‘to be just’ formed a 
hitpa“‘el in which the emphatic feature of the root-initial sibilant 
spread to the adjacent dental of the prefix (assimilation). Addi- 
tionally, the emphatic sibilant was swapped with the dental pre- 
fix: *nit-sadiq- > *nistadiq- > pToxi (Gen. 44.16; pausal form). 
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2.2.5. Analogy 


In some ways, the two great ‘engines’ of language change are 
sound change and analogy (often competing with one another in 
terms of their effect on a given language). Analogy may be 
described as the process whereby speakers, in perceiving simi- 
larities (potentially of various kinds) between two linguistic 
structures, proceed to further advance the similarities in other 
aspects of the same linguistic structures. 

For example, some scholars hold that the weqgatal form in 
Biblical Hebrew is perhaps an analogical development drawing 
from the wayyiqtol. When final short vowels were lost, the Semitic 
jussive-preterite form yaqtul merged in most contexts with the 
imperfective yaqtulu. In Biblical Hebrew, the old (short) yiqtol 
lives on mainly in the wayyiqtol form. The imperfective (long) 
yiqtol, by contrast, is used to indicate various modal nuances, as 
well as imperfective aspect. Later speakers, unaware of this ear- 
lier development, could have perceived in the prefixed waw of 
the wayyiqtol as having a ‘converting’ effect, transforming an 
imperfective yiqtol into a perfective preterite (hence the waw- 
conversive terminology in some Biblical Hebrew pedagogy). The 
analogy comes into play when speakers took the suffix conjuga- 
tion form, prefixed it with waw, and ‘converted’ its meaning from 
perfective to imperfective (i.e., the inverse of wayyiqtol). Thus the 
wayyiqtol, on this analysis, represents a direct historical develop- 
ment of the original jussive-preterite yaqtul, while the weqatal 
represents an analogical development that was constructed on 


the model of wayyiqtol. 
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When engaging in the task of linguistic reconstruction, it is 
critical to distinguish between genuine historical sound changes 
and various analogical effects (which sometimes even serve to 
undo particular sound changes in languages). One common form 
of analogy is paradigm levelling. Speakers over time tend to reduce 
inflectional distinctions in verbal or nominal paradigms, usually 
triggered by various kinds of analogy. For example, the 3mp pre- 
fix conjugation in Ugaritic features a t- element (thus in contrast 
to the y- of Biblical Hebrew). An example may be cited from a 
letter, KTU 2.63: ilm tgrk tslmk ‘may the gods (mp) guard you 
[and] keep you’. Notice that both verbs have a masculine subject 
(ilm ‘gods’), yet feature a t-prefix. Likewise, an Amarna Canaanite 
letter (EA 252) contains an unambiguous example of a 3mp prefix 
conjugation form with a t- element: ti-ma-ha-st-ka ‘they strike 
you’. 

This feature (from second millennium BCE Amarna Canaanite 
and Ugaritic) is used by Robyn Vern in part to argue against the 
view that there is such a thing as ‘archaic’ Biblical Hebrew Poetry. 
For her, one would need to demonstrate the presence of traces of 
this feature (among others) in Biblical Hebrew passages that are 
presumed to be archaic in order to sustain such a claim: “If there 
is sustainable evidence that there are second millennium rem- 
nants of the 3mp t-preformative, these poetic texts may be con- 
sidered of second millennium typology with regard to this par- 
ticular linguistic feature” (Vern 2011, 203). 

Yet a consideration of two alternative models for the recon- 
structed paradigm of the prefix conjugation in Northwest Semitic 


indicates that the change of *ya- — ta- is analogically justified 
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with Reconstruction B, while the reverse is manifestly not under 
either A or B.*' It seems that paradigm levelling in Amarna 
Canaanite and Ugaritic was brought about by the pressure to 
unify all plural prefixes (apart from 1c) under ta-, a scenario 
favouring Reconstruction B. See Figure 5 below for an illustration 
of this mechanism. 


Figure 5: Reconstructed paradigm of the prefix conjugation in 
Northwest Semitic 


Reconstruction A Reconstruction B 


3m *ya-qtul-ti *ya-qtul-t 
3f *ya-qtul-na *ta-qtul-na 

2m *ta-qtul-u *ta-qtul-t 
2f *ta-qtul-na *ta-qtul-na 
1c *na-qtul *na-qtul 


In considering two alternative reconstructions, historical lin- 
guists generally favour paradigmatic heterogeneity (all things 
being equal). 

A final type of analogy to be discussed is folk etymology. A 
classic example in Biblical Hebrew is the traditional rendering of 
nin>x ‘shadow of death’, which occurs eighteen times in the Old 
Testament (most famously in Psalm 23.4). This understanding is 
based on a morphological segmentation >yx ‘shadow’ + nin 
‘death’. Such an interpretation is already seen in the ancient 
translations, e.g., the Septuagint (oxida tot Oavatov ‘shadow of 
death’) and the Targums (xnia7 xd1w ‘the shadow of death’). Its 


*! Some scholars make use of an arrow (—) or a double greater-than 
sign (>>) in order to distinguish analogical change from sound change 
(>). Note that Reconstruction A is based on the homogeneity (with *ya- 
) exhibited by all major, early languages. 
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parallels in poetry, however, imply that the word is somehow 
associated with darkness, which is not a necessary interpretation 
of the phrase ‘shadow of death’. An apparent cognate in Ugaritic, 
galmt ‘darkness’ (variant spelling zlmt), suggests that nin>y should 
probably be segmented as a derivative in -ut from an unattested 
stative root Dox* ‘to be dark’ (cf. Akkadian salamu ‘to be dark’). 
One can grasp by this example how folk etymology works. It 
occurs when speakers accidentally reinterpret a word or a phrase 
by morphological resegmentation. The original word niny* 
‘darkness’ would have been reanalysed as by ‘shadow’ + nin 
‘death’, leading to sound change (ni- > ni-) that appears to be 


irregular. 


2.2.6. Other theoretical concepts 


Space precludes discussion of the many other kinds of language 
change (including syntactic change and semantic change), as well 
as additional methods historical linguists use to recover such 
changes (for example, internal reconstruction). As with many 
other fields of linguistics, historical linguistics is a broad field, 
with many sub-disciplines. Scholars continue to make progress in 
a number of increasingly specialised fields, while at the same 
time interdisciplinary approaches are growing in popularity, 
even drawing on fields outside of linguistics (for example, 
the recent application of Bayesian phylogenetics to language 


classification). 


212 Ozoling 


3.0. Historical Linguistics in Biblical Studies 


The methods and tools of historical linguistics have been applied 
to biblical studies with ever growing intensity over the past few 
decades. The main area of application by far has been the Old 
Testament, which by its nature is arguably more amenable to 
such research. Despite this, improvement in work done in both 
the Old and New Testaments is a desideratum, as some of it can 
be uninformed at times about a number of generally accepted 
assumptions of historical linguistics. 

Within Old Testament studies, historical linguistics has 
been applied most significantly (and increasingly controversially) 
to the question of the dating of its various texts. Fassberg (2016) 
offers an excellent overview of the history of scholarship in this 
area. He outlines three major time periods, the first beginning 
in the nineteenth century with the publication of Gesenius’ 
Geschichte der hebrdischen Sprache und Schrift (1815) and con- 
cluding with the publishing of the first fragments of Ben Sira in 
1896 by Solomon Schechter. The second period continues with 
the incorporation of the insights about the language of Ben Sira, 
as well as that of the Damascus Document (published in 1910), 
until the discovery of the Dead Sea Scrolls (DSS) in 1947. The 
third period continues unto the present day, as the data about the 
DSS and especially the language contained in its non-biblical 
texts continues to be analysed. Especially noteworthy are the 
rigorous methodological principles for dating texts developed by 
Avi Hurvitz and continued by his students. A significant mono- 
graph that represents the cumulation of this research is Hurvitz’s 
lexicon of Late Biblical Hebrew (2014). 
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Rezetko and Young (2019) argue persuasively that a fourth 
period may be discerned, which began in the 2000s. Within this 
current period, the heretofore accepted practice of dating biblical 
texts by means of linguistic features began to be seriously chal- 
lenged by a number of scholars, most particularly in a series of 
works by Ian Young, Robert Rezetko, and Martin Ehrensvard 
(Young, Rezetko, and Ehrensvard 2008; Rezetko and Young 
2014). Their work has generated intense debate and fresh research 
in this area. In the last decade, several major monographs on the 
question of diachrony in biblical Hebrew have emerged (e.g., 
Miller-Naudé and Zevit 2012; Moshavi and Notarius 2017). 
Although a definitive consensus has not yet emerged from all the 
debate, the discussions have fostered much more methodological 
caution in deriving actual dates (or even rough time periods) 
from the results of investigations into the language of particular 
texts of the Old Testament. To take but a single example, consider 
the presence of the apparent Aramaism 11m ‘they recount’ (Judg. 
5.11) in the linguistically archaic Song of Deborah and Barak.” 
The presence of Aramaisms in Biblical Hebrew texts is typically 
taken to be evidence of a late date of composition. Yet this is 
probably an oversimplification, since there would have been 
early language contact between Aramaic-speaking peoples and 
those of Canaanite stock (such as Biblical Hebrew speakers). Such 
forms indicate that care must be taken in assigning dates to the 


language of biblical texts. 


*” The Old Greek rendering dwcovcw ‘they give’ appears to be an 
alternative vocalisation reflecting an expected 11m. 
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An overall assessment of the debate can be briefly offered 
here. It seems that the confidence exhibited by an earlier genera- 
tion of scholars has been tempered somewhat. (One thinks of 
Driver’s [1900, lxiii] famously confident assertion about the dat- 
ing of Daniel over a century ago, also mentioned in §2.2.2 above: 
“The verdict of the language of Daniel is thus clear... the Greek 
words demand, the Hebrew supports, and the Aramaic permits, a 
date after the conquest of Palestine by Alexander the Great” 
[emphasis original].) Even if absolute dates for language states 
could be established, there is no exact correspondence between 
language states and ancient biblical texts, which are composite 
documents received via centuries of textual transmission. 

Furthermore, the contributions of scholars such as Gary 
Rendsburg have helpfully introduced the dialectal dimension to 
the diachronic debate: the language variation we see in the Old 
Testament text should not only be attributed to diachronic fac- 
tors. To this might also be added other sociolinguistic phenomena 
such as register and even an author’s idiolect. 

At the same time, the extreme scepticism of Rezetko, Young, 
and Ehrensvard seems unwarranted. As Aaron Hornkohl (2017, 
75) notes: “We may... reasonably accept that BH has a history 
and that the general lines of this history may be traced in the 
Masoretic editions of biblical literature.” The textual tradition for 
the Old Testament is reasonably secure (certainly for the pur- 
poses of detecting diachronic change). The Masoretic tradition 
has been demonstrated to have faithfully preserved minute lin- 


guistic nuances (even comparing favourably to many earlier texts 
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from the Dead Sea, which exhibit a degree of linguistic updating 
and harmonisation). 

The scholarship of Avi Hurvitz and his students has pro- 
duced fairly rigorous criteria for evaluating potential diachronic 
changes. For example, the criterion of linguistic opposition estab- 
lishes evidence for real linguistic change in the complementary 
distribution of lexemes such as ww ‘linen’ (Classical Biblical 
Hebrew) and pia ‘linen’ (Late Biblical Hebrew). Furthermore, 
although the extra-biblical evidence is meagre, it corroborates 
the picture we see within the biblical texts. 

In short, while the vigorous debate in recent decades has 
produced positive improvements in methodology and it has help- 
fully tempered some of the more confident claims of an earlier 
generation of scholarship, it does not seem to have overturned 
the basic consensus: the Old Testament is a collection of texts 
produced over a period of several centuries, and these texts give 


evidence of diachronic linguistic development. 


4.0. Prospects for Further Study and Application 


No such equivalent application of the methods and tools of his- 
torical linguistics currently exists in New Testament studies. New 
Testament Greek, however, is a potential growth area and histori- 
cal linguistics may well provide some tools for better understand- 
ing the Greek verb (especially from its Indo-European vantage 
point). A promising development in this regard is the recent pub- 
lication of The Greek Verb Revisited by editors Steven Runge and 
Christopher Fresch (2016). Within this collection of essays, one 
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sees a new focus on the diachronic dimension as a necessary dia- 
logue partner in the serious current scholarly disputes of the 
nature of the Koine Greek verb (see in particular the essays by 
Rutger Allen, Peter Gentry, and Amalia Moser). 

Historical linguistics as a discipline stands as a welcome 
tool to scholars of all backgrounds engaged in the study of the 
biblical languages (as well as other ancient languages). Even the 
study of a single time period of a language is the study of a lan- 
guage in a state of perpetual (though gradual) diachronic flux 
and transition. This chapter has hopefully demonstrated that the 
tools and methods that historical linguists employ can be profit- 
ably used by biblical scholars as a means of advancing the state 


of our exegetical and historical knowledge of the biblical texts. 
5.0. Further Reading 


5.1. Handbooks, Companions, Etymological 


Dictionaries 


. Greek morphonology: Rix (1992) 

. Greek etymology: Chantraine (2009) 

. Greek verb: Runge and Fresch (2016) 

. Hebrew morphonology: Blau (2010) 

. Hebrew and Aramaic: Noonan (2020) 

. Non-Semitic Loanwords: Noonan (2019) 
. Hebrew vowels: Suchard (2019) 

. Semitic etymology: Kogan (2015) 
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5.2. General Introductions 


. Historical linguistics: Campbell (2021) 
. Indo-European: Fortson (2010) 

. Greek: Horrocks (2014) 

. Semitic: Rubin (2010) 

. Hebrew: Sdenz-Badillos (1993) 

. Aramaic: Gzella (2021) 


na Bb wWNY 


5.3. Foundational Texts 


1. Indo-European: Klein, Joseph, and Fritz (2017) 

2. Wackernagel’s 1918-19 lectures: Langslow (2009) 
3. Historical linguistics: Hock (1991) 

4. Indo-European: Kurytowcz and Mayrhofer (1968) 
5. Semitic: Huehnergard and Pat-El (2019) 
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COMPUTATIONAL LINGUISTIC ANALYSIS 
OF THE BIBLICAL TEXT 


Willem Th. van Peursen 


1.0. History and Development of the Theory 


1.1. Introduction 


The application of computational linguistics to the Bible is part 
of the broader field of ‘Bible and Computer’ as it was coined in 
the 1970s and which encompasses, besides linguistic research, an 
ever-increasing field of computational textual analysis applied to 
the biblical text. It takes place at the intersection of biblical stud- 
ies and the rapidly developing field of Digital Humanities. 

This chapter deals with computational linguistics as a 
method, besides the other methods described in this volume. It 
should be recalled, however, that computational linguistics is 
interwoven with other approaches. When we compare the vari- 
ous available syntactic databases of the Hebrew Bible, we can 
observe, for example, that of the three most well-known data- 
bases the Andersen-Forbes database is explicitly eclectic in its 
linguistic theory (cf. Andersen and Forbes 2012).' On the other 


' For project documentation and bibliographical references, see 
http://andersen-forbes.org, accessed 1 May 2023. 
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hand, the ETCBC database or BHSA (cf. Kingham and Van Peursen 
2018) builds on the form-to-function approach developed by 
Jaap Hoftijzer and Wolfgang Richter (Van Peursen 2007, 140- 
41) and is influenced by the text-syntactic approach developed 
by Harald Weinrich and Wolfgang Schneider (Van Peursen 
2020a, 140-55). Holmstedt’s and Abbegg’s Accordance Hebrew 
Syntax Database (Holmstedt and Cook 2018) is highly informed 
by generative linguistics (Accordance documentation 2014).? 
Because of these observations, it would be an oversimpli- 
fication to treat computational linguistics as a method distinct 
from, for example, generative linguistics or Cognitive Linguistics. 
Each computational linguistic analysis uses a digital corpus and 
each of these corpora is rooted in linguistic theories (see below, 
§2.3). Moreover, computational linguistics is a broad field that 
includes various approaches such as rule-based computer-science, 
statistics, Artificial Intelligence, and Deep Learning.* All these 
approaches have been applied to the biblical languages, and 
hence this chapter will present various approaches rather than 
one single method. Nevertheless, we shall see that these com- 
putational approaches have some common features that justify 
treating them together and that they have developed further in 


ways that are typical for computational corpus linguistics and go 


? For other database projects on Biblical Hebrew that were active over 
the last decades see Kroeze (2013). 


3 In addition, computational linguistics as a discipline also covers 
approaches such as speech recognition or natural language generation 
that fall outside the scope of the current chapter. For an overview see, 
e.g., Clark, Fox, and Lappin (2010); Jurafsky and Martin (2021). 
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beyond the linguistic theories underlying the annotations in the 
respective databases. This reality justifies a separate chapter in 
the current volume devoted to computational linguistics as a 
method by itself. 


1.2. The Beginnings 


From our remarks in the introduction, it will be evident that it is 
impossible to give a historical survey of the application of compu- 
tational linguistics to Biblical Hebrew without considering the 
wider context. This is the context of text and computing as an 
emerging field of studies in the twentieth century. Leaving aside 
for the moment predecessors such as the mechanical machines 
that were made, or at least designed, in the nineteenth century, 
such as the design for a mechanical general-purpose computer by 
Charles Babbage, we will start this survey with the emergence of 
the forerunners of the modern computers in the 1940s and 1950s. 
In this period, we see the transformation of the calculation 
machine into the universal machine: that is, a machine that can 
do any task for which it is programmed. In these early years, 
Robert Busa started his famous project of the Index Thomisticus, 
which involved the complete lemmatisation of the works of 
Thomas Aquinas (which consists of 181 works, comprising 11 
million words). This monumental project started in 1949 and 
lasted about thirty years. 

The first universal computers created were not primarily 
meant for text processing. It should be recalled that the combi- 
nation of text/language and computation/calculation is not as 


self-evident as it now seems. Even long before the emergence of 


226 van Peursen 


computers, the fuzziness and ambiguity of natural language frus- 
trated projects like those of Wilhelm Leibniz (1646-1716), who 
for “his whole life... continued to believe in the construction of 
a language consisting of logical symbols that could be manipu- 
lated by means of a calculator. Such a language, and a machine 
to ‘calculate’ it, would enable any philosophical debate to be 
settled with the click of a button” (Van der Weel 2011, 106). 
Likewise, “around a hundred years ago, polymaths like Bertrand 
Russell were furiously fighting to capture the nuances of lan- 
guage with a view to developing a universal formal language,” 
which remained an ongoing academic pursuit that continued in 
the field of computer science, but appeared to be a highly chal- 
lenging project (see action.ai 2021). 

It was only in the 1960s and the 1970s that the marriage 
between computer and text took place. In the 1960s, computers 
became able to process text. A milestone was the first edition of 
the ASCII standard in 1963. This standard involved a 7-bit encod- 
ing in which, for example, 1000000 stands for @, 1000001 for 
A, and 1000010 for B, and 1000011 for C. In total, the ASCII 
standard contained 128 codes. Accordingly, the first attempts to 
create electronic versions of the Hebrew Bible had to accommo- 
date this standard. 

These attempts started in the 1970s. In 1970, Francis 
Andersen and Dean Forbes started a project that finally resulted 
in the Andersen—Forbes database. In the same year, Christof Felix 
Hardmeier (1970) from Greifswald reported on his own experi- 
ments in his article on the new potential of electronic data 


processing. Somewhat later, in 1977, the Werkgroep Informatica 
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Vrije Universiteit (WIVU) was established in Amsterdam under 
the guidance of Eep Talstra (after whom the WIVU was rebap- 
tised as the Eep Talstra Centre for Bible and Computer [ETCBC] 
in 2013), which marked the start of the WIVU/ETCBC database. 
At Westminster Theological Seminary in Philadelphia, J. Alan 
Groves started pioneering work which initiated the research at 
what is now called the J. Alan Groves Center for Advanced Bibli- 
cal Research. This work resulted in the Westminster Leningrad 
Codex (first released in 1987), to be followed by the Westminster 
Hebrew Morphology (also known as the Groves-Wheeler mor- 
phology) and the Westminster Hebrew Syntax. 

Pioneers such as Andersen, Forbes, Hardmeier, and Talstra 
found each other in the Association Internationale Bible et 
Informatique (AIBI), which was established in 1982 and held its 
first conference in 1985 in Louvain-la-Neuve. Besides the pio- 
neers already mentioned (and others such as Marc Vervenne or 
Emanuel Tov), a driving force behind this organisation was R. F. 
Poswick from the Benedictine monastery of Maredsous. The 
theme of the first AIBI conference was ‘the text’, and that was 
precisely the main challenge during those years: how to represent 
the Hebrew text and linguistic annotations. There was no 
Unicode, no markup language like HTML and XML, and not even 
a PC back then. The first challenge these pioneers faced was 
building electronic text corpora, displaying them on the screen, 


and handling the right-to-left writing direction for Hebrew. 
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1.3. Interface, Office and Network 


Major changes took place in the 1980s and 1990s, which were 
related to such terms and abbreviations as GUI (Graphical 
User Interface), the DTP (Desktop Publishing) revolution, and 
WYSISWYG (What You See Is What You Get). These changes can 
be illustrated by the introduction of the Apple Lisa in 1983, the 
first version of the program PageMaker in 1984, and the first 
release of Microsoft Office in 1989. These developments marked 
a change in the application of the computer towards more office- 
related activities. With this development, the use of the computer 
became much more widespread, both in number of users and in 
types of applications. In the field of biblical studies this resulted 
in the appearance of software packages such as BibleWorks (first 
release in 1992) and Accordance (first release in 1994). 

These new tools became extremely helpful for biblical 
scholars. One could now display the Hebrew Bible and the 
ancient versions side by side, search for words and word com- 
binations in the electronic text instead of consulting a printed 
concordance, and store large commentaries on one’s disk rather 
than on one’s bookshelves.* A side-effect of this development, 
however, was a shift of focus. The early pioneers of ‘Bible and 
Computer’ were mainly concerned with the computer as an ana- 
lytical tool, but in practice, it rather became a useful office tool. 


Being able to search for a word with a query instead of looking 


‘ But often, again, the computer was used to generate concordances that 
were published in print. Thus, e.g., Postma, Talstra, and Vervenne 
(1983); cf. Oosting (2016, 195). 
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it up in a printed concordance may be a little bit faster, but it 
is not a methodological improvement. A burning question that 
occupied the early pioneers but seemed to be hardly a concern 
for the broader community of biblical scholars was: How can we 
go beyond the imitation of the traditional instruments?° 
Another effect of the developments in the 1980s described 
here was that some of the databases that were initiated in the 
1970s and 1980s became commercial products. To my best 
knowledge, it is only the ETCBC database that is publicly avail- 
able,° while the Andersen-Forbes database is only available in 
the commercial Bible software packages of Logos and Accord- 
ance, and the more recent Holmstedt-Abbegg database only in 
Accordance.’ This has hindered further development within the 


scholarly community, because one of the primary conditions of 


° See the telling title of Talstra and Dyk (2006): ‘The Computer and 
Biblical Research. Are there Perspectives beyond the Imitation of 
Classical Instruments?’ 


° https://github.com/ETCBC/bhsa, accessed 4 May 2023. Recently, also 
the MACULA Hebrew syntax trees have become available at https:// 
github.com/Clear-Bible/macula-hebrew, accessed 4 May 2023. These 
syntax trees have been developed by Clear Bible, Inc. together with the 
Groves Cente and build on the Westminster Hebrew Syntax Without 
Morphology and the Open Scriptures Hebrew Bible morphology (serv- 
ing in place of the Westminster Hebrew Morphology). The Groves 
Center has also released the Westminster Hebrew Syntax Without 
Morphology at https://github.com/Clear-Bible/macula-hebrew/tree/ 
main/sources/GrovesCenter, accessed 4 May 2023. 

” Cf. Accordance documentation (2014), for the advantages that 
Holmstedt and Abbegg considered for integrating their database into 
the Accordance software right from the start. 
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computational linguistics is that the analyses are retrievable and 
that the underlying algorithms are available on online platforms 
such as GitHub (cf. below, §4.3). 

In the 1990s, a new element radically changed the digital 
landscape: networking. The World Wide Web was launched in 
1991 and in the same year the first version of Unicode was 
released. These two milestones were closely related, because only 
with the unequivocal definition of characters in Unicode was it 
possible to exchange text that remained stable regardless of the 
environment in which it was read. For PC users, the internet 
became accessible through the browsers that came onto the mar- 
ket, such as Netscape Navigator in 1994 and Internet Explorer in 
1995. 

This new development was also soon picked up by biblical 
scholars. Electronic journals in the field of biblical studies were 
initiated, such as TC: A Journal of Biblical Textual Criticism and 
the Journal of Hebrew Scriptures, which both started in 1996.° 

At the turn of the century, a new stage started with the 
introduction of more interactive forms of publication and com- 
munication, in which the users became both consumers and 
contributors and in which the dividing line between information 
consumption and information creation was blurred. This is often 
labelled ‘Web 2.0’. Milestones include the launch of Wikipedia 
(2001) and the emergence of social media such as Facebook 
(2004) and Twitter (2006). 


8 Mention should also be made here of Hugoye, a journal in the field of 
Syriac studies, which started in 1998. 
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This ever-growing field of textual and social computer 
applications affected biblical studies. The use of electronic tools 
was no longer the privilege of biblical scholars. More and more, 
everyone had an increasing number of online Bibles and Bible 
study tools at their disposal. Likewise, the field of ‘Bible and Com- 
puter’, as defined by the AIBI, was expanding as well. At the sixth 
AIBI conference held in Stellenbosch in 2000, there were sections 
on grammar, statistics, and discourse, but also on education, 
multi-media, publishing, and community, all in relation to the 


Bible and the computer. 


1.4. Reorientation: Methodological Innovation? 


The development described above was not the programme that 
the pioneers of the 1970s and 1980s had in mind when they 
started their work. A re-orientation took place in the first decade 
of the twenty-first century. The seventh AIBI conference (con- 
vened by Marc Vervenne, Leuven, 2004) and the eighth confer- 
ence (convened by Luis Vegas Montaner, Madrid, 2008) were 
both presented as expert meetings focusing on the question of 
how the computer can play an innovative role in biblical schol- 
arship.? How could the computer be used as an analytical tool, 
rather than merely as a library, an office tool, and an imitation 
of traditional tools, which it apparently had become in the 
1990s? 

The question regarding the role of the computer in meth- 


odological innovation touched upon the more encompassing 


? See the overview given in Poswick (2010), but note that the Leuven 
2004 conference is absent from Poswick’s overview. 
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question regarding textual scholarship as a humanities discipline 
in relation to computer science, which typically belongs to the 
sciences. Computation and the related scientific mode of inquiry 
gave the ability to sort, quantify, reproduce, and report text, but 
how could this be fruitfully combined with interpretation as the 
valued mode of assigning or discovering meaning as understood 
in traditional scholarship and the related reflexive concepts of 
individualism and subjectivity (Van Peursen 2010)? 

The final decades of the twentieth century had witnessed a 
shift in the humanities from the hermeneutic and critical tradi- 
tion of the nineteenth and twentieth centuries towards the identi- 
fication and representation of patterns by digital means in the 
second half of the twentieth and the early years of the twenty- 
first century. Rens Bod (2013) coined the two phases ‘Humanities 
1.0’ and ‘Humanities 2.0’. (Note that ‘2.0’ is used here differently 
to in ‘Web 2.0’ discussed above). Humanities 1.0 embodied the 
traditional understanding of the humanities as it was framed at 
the end of the nineteenth century. Wilhelm Dilthey (1833-1911) 
and others advocated a clear-cut distinction between the humani- 
ties and the sciences, the first mainly involved in Verstehen 
(understanding) the second in Erkldren (explanation). This dis- 
tinction had a significant impact on modes of scholarship, but 
also on the organisation of academia, where most institutions 
have separate departments for the humanities and the sciences. 
With the appearance of the computer as a tool for textual schol- 
arship (Humanities 2.0), this distinction was blurred. How could 
this distinction be maintained now that computer scientists 


seemed to be analysing texts in the same way in which natural 
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scientists analysed DNA structures? Rens Bod (2013, 177) has 
argued that this new mode of scholarship should not be the end 
point, but that a next step should be taken (which he labelled 
Humanities 3.0), in which Humanities 1.0 and Humanities 2.0 
are combined and in which the hermeneutic and critical tradition 
of Humanities 1.0 should be applied to the tools and patterns 
obtained by Humanities 2.0. 


1.5. From Rule-Based Analysis to Machine Learning 


While biblical scholars and textual scholars in general were busy 
incorporating computer science into their disciplines, computer 
science itself developed further with astonishing speed. Let us 
illustrate this with the example of machine translation. From the 
early days of computational linguistics, it was evident that it 
would be tremendously useful if the computer could be used to 
translate a text from one language into another. As early as the 
1970s, attempts were made to achieve this task by rule-based 
machine translation. In this approach, the input that the com- 
puter receives is the text to be translated and language rules. 
These rules include, for example, a bilingual lexicon, mor- 
phology, and syntax. The more refined those rules, the fewer 
errors the translation contains and the better it becomes. How- 
ever, after decades of improvements, the results did not meet the 
high expectations. Natural language appeared more unruly than 
people thought (cf. above, 81.1). The rule-based techniques of 
the 1970s to the 1990s were replaced by statistical approaches in 
the 1990s until the 2010s. However, although there was sig- 


nificant progress, the real breakthrough came only with the 
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application of machine learning. Here the input is no longer a 
text to be translated and a set of rules to carry out this task, but 
rather a large collection of training data, in this case of parallel 
texts in two languages, from which the computer itself can learn 
how to translate. Although the mechanisms that are at work are 
largely hidden, the performance is outstanding. 

If we define machine learning more precisely, we can say 
that it is the ability to learn without being explicitly pro- 
grammed. It is a subgroup of Artificial Intelligence, which refers 
to any technique that enables computers to mimic human behav- 
iour or, more precisely, the effort to automate intellectual tasks 
normally performed by humans. Artificial Intelligence (AI) went 
through various stages by itself, from symbolic AI, which was 
prevalent until the 1980s and involved the application of explicit 
rules and the manipulation of logic, to machine learning, where 
the computer goes beyond the instruction and rules it is given 
and learns by itself how to perform a certain task. A subgroup of 
machine learning is deep learning, which refers to the extraction 
of patterns from data with the help of neural networks. In the 
case of machine learning, we can distinguish between supervised 
machine learning, in which the machine learns from human- 
labelled examples, and unsupervised learning, in which the 
machine has to detect patterns in the unlabelled data by itself. 
Supervised methods include attempts for text classification. In 
biblical studies, an example is Dicta (see n. 10), which provides 
an exciting collection of tools for author recognition and text 


classification, such as the Tiberias Stylistic Classifier (cf. below, 
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§§2.5 and 3.3).'° These tools can be used, for example, to classify 
a text of debated origin along the lines of early and late Biblical 
Hebrew. 

Experts may challenge the rather simple definitions given 
here for machine learning, deep learning, and Artificial Intel- 
ligence, and there is much debate about the exact nature and 
definitions of the various designations given here. But the main 
point to be made is that a major shift has taken place, which 
affects our whole understanding of using the computer in linguis- 
tic analysis. Elsewhere I have suggested that the transformations 
that are taking place now with the transition from rule-based 
approaches to machine learning may even mark a more drastic 
discontinuity with existing methods of biblical interpretation 
than the appearance of the computer as an exegetical tool in the 
last decades of the twentieth century (Van Peursen 2020b, 310). 
Whereas many of the rule-based approaches could somehow 
mimic traditional approaches (e.g., queries replacing concord- 
ances or manually created lists), machine learning opens up 
completely new avenues of scholarship that may lead to new 
forms of human-computer interaction in the interpretation of 


texts. 


1.6. Corpora and Fuzzy Data 


The application of computational linguistics to the Bible implied 
that the Bible was considered a corpus, and thus it entered the 


field of corpus linguistics. However, in this field of studies, the 


1° See https://tiberias.dicta.org.il/, accessed 4 May 2023. 
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Hebrew Bible did not match the corpora from other languages 
and periods. There are huge differences between, for example, 
the British National Corpus (BNC) and the Hebrew Bible. The 
BNC has more than 100 million words, as against 420,000 words 
in the Hebrew Bible. The BNC has extensive metadata about, for 
example, author and date of origin, whereas for almost every part 
of the Hebrew Bible authors and provenance are debated issues. 
The BNC has been carefully selected to create a representative 
linguistic corpus, whereas the Hebrew Bible, whatever the selec- 
tion processes that made it the biblical canon that we now have, 
was never intended to be linguistically representative and was 
selected along completely different criteria. Accordingly, at least 
until the turn of the century, the computational analysis of the 
Hebrew Bible was a questionable undertaking according to the 
developing standards of computational linguists. When in the 
early 1990s Eep Talstra once presented his research on Deuteron- 
omy to an audience of computational linguists, he met with much 
misunderstanding. How could he study a corpus of which he did 
not know the date of origin? Unaware of the complex questions 
regarding sources and editorial processes in the Hebrew Bible 
that have puzzled biblical scholars for centuries, one of the 
respondents suggested that Talstra should first clean up his data 
(that is: stripping it of any later additions so that what remains is 
a corpus of which the date and provenance are clear) before any 
linguistic or textual analysis could start (Talstra 2010, 54). 

This situation has completely changed since the above- 
mentioned emergence of Web 2.0. Currently, much linguistic 


research is conducted on tweets, blogposts, and other digitally 
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born texts that represent a kind of fuzzy data, with little context 
and little metadata, all of which resembles the Hebrew Bible 
much more than the BNC does. Thanks to these developments, 
computational research into the Bible has found a better con- 
nection with the wider field of Digital Humanities than in the last 


three to four decades of the twentieth century. 


2.0. Key Theoretical Commitments and Major 


Concepts 


2.1. Solid Criteria instead of Subjective Intuition 


Biblical scholars were among the first who experimented using 
the computer as a tool for the study of texts and languages. In the 
1970s, at the dawn of computer-aided textual analysis and more 
specifically in the newly emerging field of ‘Bible and Computer’, 
the mission of the pioneers (above, 81.2.) was clear: Make mean- 
ingful, substantiated statements about the Bible. Such an effort 
did not guarantee the correct interpretation, but at least it could 
identify interpretations that did not match the facts. Traceability 


and transparency played an important role in this development." 


'' A typical example is Hardmeier’s above-mentioned article. Addres- 
sing the question as to whether the computer can help in traditional 
source criticsm, Hardmeier (1970, 180) argues: “Die maschinelle 
Konkordanzarbeit erméglicht dagegen ein Zweifaches: Einmal kann der 
Kriterienkatalog iiber die Wortschatzstatistik hinaus auf neue, rein 
formale Struktureigentiimlichkeigen bestimmter Texte ausgedehnt werden 
(...) Zum anderen kan tiberpriift werden, wieweit lexikalische und 
formale Merkmale fiir bestimmte Textschichten iiberhaupt charak- 
teristisch sind.” It was, however, only after the emergence of machine 
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This mission fit in a positivist, modernist climate, and was 
especially opposed to an unbridled theologising based on indi- 
vidual words and etymologies, which has made, among others, 
Kittel’s Theologisches Worterbuch zum Neuen Testament (1942- 
1979) famous (or infamous). In the 1960s, James Barr assessed 
this approach critically on the basis of general linguistic and 
philological insights (see Barr 1961). Talstra, who earlier in the 
1970s had studied with Barr in Manchester, would certainly 
agree with Barr’s criticism, and started creating a database that 
was not focused on etymology and semantics, but on syntax. 

This mission of those pioneers in Digital Humanities (at 
that time other labels were used such as Alpha-Informatics) still 
plays an important role and is reflected in open science practices. 
Statements about occurrences of words or patterns can be made 
traceable and reproducible, for example by publishing queries on 
the SHEBANQ website of the ETCBC."* At the same time, we have 
also seen developments in computational textual analysis that 


run counter to the ideals of the pioneers (above, 81.3). 


2.2. Deep Blue and AlphaGo 


The introduction of the computer in the workplace of the exegete 
in the last decades of the twentieth century enabled the biblical 
scholar to be more systematic, objective, and quantitative. The 


qualification ‘objective’ does not mean that computational data 


learning that the computer was used fruitfully to address the traditional 
source-critical questions; cf. below, 83.3. 

” See https://shebanq.ancient-data.org, accessed 4 May 2023. For the 
underlying ideas and the role of annotation, see Roorda (2018). 
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is theory-neutral or that the computer provides the final answer 
(as it is sometimes misunderstood), but it does indicate another 
mode of scholarship. The added value of this work lies in its 
systematic approach, which reduces the role of intuitive ad hoc 
interpretations, on the one hand, and in the increasing complex- 
ity of searches and analyses, on the other. 

In the 1970s—2000s, the claim that the computer was more 
systematic and objective was frequently met with scepticism by 
traditional scholars, who often found those computer guys weird 
and imagined that using the computer as a tool in biblical studies 
was in fact building an echoing well: what you get out of the 
computer depends on what you put into it. Often the question 
was raised: What does the computer deliver that could not be 
delivered without it? To parry this criticism, I often used the 
analogy of the chess computer, which seemed especially apt since 
Deep Blue had beaten Gary Kasparov in 1997. The mind of the 
human chess player works efficiently because it recognises pat- 
terns and therefore has a useful selection mechanism, whereas 
the computer, so to speak, calculates everything (still an interest- 
ing study is De Groot 1946). However, the speed with which the 
computer does so (which has increased over the years) is so 
immense that it surpasses human capacities. Likewise, once you 
have an annotated database, questions that would take months 
or years when addressed manually—e.g., a statistical overview of 
plene or defective spellings, or a collection of all clause patterns 
in the Hebrew Bible where the object precedes the verb (for 
examples, see §3 below)—can now be answered in very short 


time periods. 
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Whereas the analogy of the chess computer worked well 
until the first years of the twenty-first century, it does not capture 
the developments that have taken place over the last two dec- 
ades, described above (81.5), such as the emergence of machine 
learning. For these developments, we can rather use the analogy 
of the Go computer (see Dorobantu 2022). Whereas the human 
world champion of chess was defeated by the computer in 1997, 
it took until 2016 before a computer program defeated the 
human world champion in Go. The difference between chess and 
Go is that with Go the complexity of the game and the number 
of possible continuations is exponentially higher than with chess. 
Moreover, in Go there is also a strong aesthetic aspect. Because 
of the infinite number of possibilities and the aesthetic aspects, 
computational calculation power is not enough to win the game. 
Rule-based approaches (enabled by calculation power) were 
insufficient, but learning capabilities (enabled by pattern recog- 
nition) succeeded. Likewise, in textual analysis the computer is 
no longer merely a powerful calculation or sorting machine. It 
has become a much more complex instrument, and to some 


extent less dependent on human input (above, §1.5). 


2.3. The Role of Linguistic Theory in the Creation of 


Text Databases 


The different endeavours to create linguistic databases of the 
Hebrew Bible reflected the different approaches that were and 
are current in biblical linguistics. Each approach has its advan- 
tages and disadvantages. And even in those cases where the 


builders of a database try to be as theory-neutral as possible, it 
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will be evident that any database and any choice that is made is 
informed by one’s position about Biblical Hebrew and about lan- 
guage in general. This is not only a challenge for computational 
approaches. It is the case in any linguistic or textual study of the 
Hebrew Bible or Greek New Testament. 

Let us have a look how linguistic theory functions in the 
three most well-known databases (above, §1.1; see also Miller- 
Naudé and Naudé 2018, 7), which we will discuss in the order of 
their age. First, the Andersen-Forbes database is eclectic and 
hence somewhat ambiguous in its relation to generative gram- 
mar. Andersen and Forbes explicitly reject Chomskyan linguistics 
but also “find much of value in the work of the generativists, 
especially generalized phrase structure grammar” (Andersen and 
Forbes 2012, 14; see also Van Peursen 2015, 301). One of the 
main reasons for their rejection of Chomskyan linguistics is their 
claim that Biblical Hebrew belongs to the non-configurational 
languages, which are “a serious impediment to the transforma- 
tionalists’ quest for Universal Grammar” (Andersen and Forbes 
2012, 87).'° Andersen was also influenced by structuralism and 
by Kenneth Pike’s tagmemics (Miller-Naudé and Naudé 2018, 7). 
Informed by syntax, function, and semantics, they developed a 
rich set of annotation labels, including, among others, seventy- 
six part-of-speech labels. 

Second, the ETCBC tried to be more independent of linguis- 


tic theory by following the principles of distributional analysis, 


'3 For a different view on the question as to whether Biblical Hebrew is 
a configurational language, partly in response to Andersen and Forbes, 
see Kaajan (2019). 
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form-to-function, and bottom-up. Unlike the Andersen—Forbes 
database, the ETCBC database has a rather minimal parts-of- 
speech set (Kingham 2018). And unlike the other databases, the 
ETCBC database does not immediately assign functions to forms, 
but starts with a distributional analysis of linguistic phenomena 
(at all levels, for example varying from morphemes to clause 
patterns) before functional labels are assigned. Moreover, the 
deduction of functions from formal criteria is transparently and 
traceably documented in auxiliary files that are part of the data 
creation workflow (Kingham 2018; Kingham and Van Peursen 
2018). 

The ‘bottom-up’ description is used for approaches that 
start from the identification of morphemes and word level analy- 
sis and move from there to the higher levels of phrases, clauses, 
sentences, and text-syntactic relations. It is often combined with 
the form-to-function principle, which holds that we should first 
make a distributional analysis of forms and patterns before any 
function can be deduced. It starts from the awareness that we 
know little about the biblical languages and that to avoid cre- 
ating an echoing well out of our own analysis or database, we 
should start with observable textual phenomena before we pro- 
ceed to function or even semantics. 

‘Bottom-up’ is often contrasted with ‘top-down’. In Biblical 
Hebrew linguistics, the latter is represented, for example, in the 
textlinguistic approach of Robert Longacre (1989), which is 
much more informed by cross-linguistic evidence and applies 
categories known from other languages (such as narrative, 


predictive, hortatory, or expository genres; techniques for, for 
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example, distinguishing between mainline and offline informa- 
tion or for indicating the peak of a text or discourse). Likewise, 
the distributional analysis of forms and patterns, which to some 
extent is like Construction Grammar or exemplar-based syntax as 
it developed in the 1980s and 1990s, is often considered as a 
counter-reaction to the generative linguistic framework. 

Third, the Holmstedt-Abbegg database, also called the 
Accordance Hebrew Syntactic Database, is based on a generative 
framework (cf. Accordance documentation 2014). Whereas the 
Andersen—Forbes and ETCBC databases started in the 1970s, the 
Holmstedt-Abbegg database started more recently, in 2008. 
Their intention was to create a database upon a model of Hebrew 
syntax that differed from the two existing databases, with “a tight 
focus on syntax, grounded in (but not bound by) Chomskyan 
generative linguistic theory” (Holmstedt and Cook 2018, 2). 
More specifically, they adhered to Chomskyan minimalism, 
which was developed in the 1990s from the Government-and- 
Binding model that was prevalent in the 1980s, but they also 
realised that “to base the database and its underlying tagging 
scheme on a fully articulated minimalist framework would be 
inappropriate.” For this reason, they combined their adherence 
to Chomskyan theory with the motto “data primary, theory wise” 
(Holmstedt and Cook 2018, 3). 

That the Holmstedt-Abbegg database is grounded in 
Chomsky’s generative approach is visible, among other things, 


in the inclusion of so-called null constituents. Because of the 


For other databases, which are not yet available publicly, such as 
Richter’s database, see Kroeze (2013). 
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generative principle that every phrase has a ‘head’, a null marker 
has been inserted in every phrase that lacks an overt head. That 
Holmstedt et al. were not bound by the generative approach is 
visible, for example, in their non-binary hierarchical clause 
analysis, thus differing from Chomsky’s minimalist syntax (as 
well as the Government-and-Binding model), which adopts a 
strictly binary approach to constituent structure (Holmstedt and 
Cook 2018, 10). 


2.4. Back to the Black Box? 


The emergence of author recognition techniques, neural net- 
works, Artificial Intelligence, and machine learning in recent 
years provided new potential for biblical studies, but it also posed 
new challenges. The results are astonishing, but exactly what the 
algorithms do takes place in an impenetrable ‘black box’. (The 
reality of machine learning is that the computer pieces together 
a set of patterns increasingly sophisticated until they fit the 
starter data, and then these patterns are used to interpret new 
texts. There is no known way to describe or articulate these pat- 
terns, however, which is why machine learning algorithms are 
spoken of as a ‘black box’.) This seems to be a development that 
is the reverse of the openness and traceability that the ‘Bible and 
Computer’ pioneers stood for. The attempts to make Artificial 


Intelligence understandable to humans in the field of ‘explainable 
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Al’ (for example, in the DIANNA project [Deep Insight in Neural 
Network Analysis]) is still in its infancy.’® 

These transformations are perhaps even more drastic than 
those of the 1970s. That early period from the 1970s showed, to 
some extent, a continuation of pre-digital scholarly practices. For 
example, it became possible to look up words with a search query 
in a digital text file instead of a paper concordance, but that did 
not imply any methodological innovation. 

In recent years, there have been various attempts to inte- 
grate these new developments into biblical studies by making 
use of advanced statistical analysis (Naaijer 2020; cf. below, 
§3.2), topic modelling (Vlaardingerbroek 2017), Markov Chains 
(Kingham et al. 2018), stylometrics (Van Hecke 2018; Van Hecke 
and De Joode 2021), and neural networks (Van der Schans et al. 
2020; Naaijer 2020, 149-75). Here the main challenge is to 
determine how the results of the ‘black box’ relate to current 
scholarship. '° 

A case in point are the projects in which text clustering 
methods are applied to questions related to linguistic dating to 


see whether we can distinguish certain groups or collections of 


1S See ‘Deep Insight And Neural Network Analysis—DIANNA’, 
https://www.esciencecenter.nl/projects/deep-insight-and-neural- 
networks-analysis-dianna/, accessed 25 May 2023. 


© Most examples in this section are taken from the ETCBC, because the 
present author is most acquainted with it, but the situation with other 
institutions and with individual researchers seems to be similar. Schol- 
ars recognise the great potential of recent developments in computer 
science but are still in an experimenting phase to find out how it can be 
made useful to biblical studies. 
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texts that agree with current scholarly notions such as Standard 
Biblical Hebrew versus Late Biblical Hebrew. The challenge is, if 
the outcomes agree with current scholarship, the computational 
analysis does not really add to our knowledge, except for con- 
firming existing theories. But if the outcome seems to be at odds 
with current scholarship, should we search for explanations that 
still fit the traditional framework (e.g., labelling outliers in an 
alleged early corpus as later additions), or should we rather chal- 
lenge and tweak the algorithms? And if we improve the algo- 
rithms so that they better yield the expected results, how do we 
avoid the risk of creating a circular argument??” 

For all the layers of linguistic analysis that were explored 
with rule-based approaches and distributional analysis from the 
1970s onwards, these new approaches have the potential to 
accelerate, refine, or automate analytical procedures. Although 
there are now various databases containing a morphological 
analysis of the Hebrew Bible, when extending the corpus to other 
Hebrew texts or corpora of other Northwest-Semitic languages, 
machine learning can be used to accelerate the process of the 
morphological analysis.'* Likewise, with the search for phrase 


patterns, new methods searching for patterns using n-grams, flex 


7 Cf. below, §3.3, for the example of the distinction between P and non- 
P by author-clustering algorithms. 


'8 Thus, the eScience Center project ‘Morphological Parser for Inflec- 
tional Languages Using Deep Learning’ aims to accelerate the analytical 
procedures by having the computer make more accurate predictions 
about the morphological analysis based on the ETCBC’s existing 
Hebrew- and Syriac-encoded texts (Naaijer and Van Peursen 2022). 


Computational Linguistic Analysis 247 


grams, etc. can replace the pattern-matching tools that func- 
tioned in the distributional analysis with which the ETCBC 
started (or the manual assignment of phrase patterns based on 
human intuition in other projects).!° In the text-syntactic analysis, 
automatic anaphora resolution can complement existing methods 
of computer-assisted, text-hierarchical analysis based on clause 
relations (cf. Erwich 2021). The identification of participants is 
the first step to establishing their relationships as the basis for 
social network analysis, and other emerging approaches in which 
computational linguistics and literary analysis meet (cf. Canu 
Hgjgaard 2021). 


2.5. From Talstra to Tiberias 


The projects and experiments described in §2.4 show a difference 
from the various approaches in the early years of ‘Bible and Com- 
puter’. In the projects of Andersen, Forbes, and Talstra, the lucid- 
ity of the rules that were applied served as an argument for the 
validity of the analysis. In those new approaches, the proof for 
the validity is not so much the structure of the algorithms or the 
analytical steps, but rather the results of test cases.”° In, for exam- 
ple, the author-clustering tools that are used for Tiberias (above, 


§1.5), what counts as convincing argument for the analysis is the 


‘9 This happens in the CLARIAH Fellowship project ‘PaTraCoSy: 
PAtterns in TRAnslation: Using COlibriCore for the Hebrew Bible corpus 
and its SYriac translation’ (Coeckelbergs 2022). 

20 In the case of the Tiberias Stylistic Classifier (see the following 
discussion), at the moment of this writing the algorithms used are not 
publicly available. 
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results of a test set.?! In their case they point to the successful 
deconstruction of an artificially mixed book, consisting of ran- 
domly merged segments from Jeremiah and Ezekiel, coined ‘Jer- 
iel’. The algorithms successfully distinguished between the two 
components of this artificial book with an accuracy of 89 to 95 
percent (Dershowitz et al. 2015). 

In conclusion, the potential of the machine learning algo- 
rithms is unprecedented, and the results are impressive. How- 
ever, what exactly those algorithms do, and how they arrive at 
their results, is beyond human understanding. The insightful and 
traceable analyses that were the showpiece of emerging computa- 
tional Bible research (above, §2.1) are now giving way to a black 
box that, while yielding great results, allows little insight into 
what goes on inside that box. Even if the output of the algorithms 
provides some insights (e.g., the Tiberias programs list the phe- 
nomena on which the results are based, such as typical linguistic 
elements of a selected corpus, which distinguish it from another 
corpus), the human researchers will have to find out by them- 
selves the typical linguistic or stylistic features of a certain corpus 


or collection (cf. below, §3.3). 


2! The term ‘author clustering’ is in this context more precise than 
‘author recognition’; cf. Dershowitz et al. (2015, 255). 
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3.0. Use and Contributions in Biblical Studies to 


Date 


3.1. Orthography 


After the emergence of databases of the Hebrew Bible, it soon 
became clear that computational analysis enabled types of 
research that were hardly imaginable without digital tools. As 
early as the 1980s, Francis Andersen and Dean Forbes (1986) 
published their monumental work on spelling in the Hebrew 
Bible, filled with tables and mathematical formulas to investigate 
the distribution of matres lectionis over the entire biblical corpus. 
They could make observations about the extent to which these 
vowel letters were used in the biblical corpus, about the relation 
of the Masoretic Text to the more defective pre-exilic inscriptions 
and the more plene spellings of the last centuries BC, and about 
differences between the various parts of the Hebrew Bible, with 
the Pentateuch having the most conservative spelling. More 
recently, Johan de Joode and Dirk Speelman (2020) have applied 
quantitative linguistic methods to the orthographic heterogeneity 
within the Hebrew Bible and the Dead Sea Scrolls. 


3.2. Syntax 


When syntactic databases became available, all kinds of research 
questions could be addressed more effectively, ranging from 
major questions about diachronic developments (Siebesma- 
Mannens 2014), to the extent to which poetic structure affects 
clause patterns (Bosman 2019), to corresponding phrase and 


clause patterns in the Hebrew Bible and the Peshitta (Van 
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Peursen 2007; Dyk and Van Keulen 2013), and to the interpre- 
tation of specific grammatical phenomena or translation issues. 
It is now easy to find parallels for the construction in the phrase 
NINO DPA OPAX ya pqa 727 Wwm ‘On that day Deborah and 
Barak son of Abinoam sang (f. sg.)’ (Judg. 5.1), where the verb 
preceding the compound subject agrees with the first element of 
this subject. Those parallels show that this is a common phe- 
nomenon and that an emendation of the verb or the deletion of 
the second part of the subject (‘and Barak...’) is not needed 
(Sandborg-Petersen 2011; Meeuse 2021, 10). Likewise, a careful 
analysis of the verb valence pattern used shows that the phrase 
nix pp? mm ow (Gen. 4.15) should be translated ‘And the LORD 
set a sign in place on behalf of Cain’ rather than with ‘And the 
LORD put a mark on Cain’, which is the rendering of the NRSV 
and many other translations (Dyk, Glanz, and Oosting 2013, 30- 
32; Meeuse 2021, 6-7). 

The more advanced applications of statistical analysis and 
machine learning enable new possibilities for charting the dis- 
tribution of clause patterns over the Hebrew Bible in relation to 
various parameters, such as assumed date of origin, genre, text 
type, and sentence pattern. An interesting case concerns the dis- 
tribution of ‘to be’ constructions. In Biblical Hebrew, there are 
five ways in which ‘to be’ can be expressed: Bipartite and tripar- 
tite nominal clauses; constructions with the particles vw (‘there 
is’) and px (‘there is not’); and clauses containing the verb 777 (‘to 
be’). On the basis of quantitative analysis taking all these parame- 
ters into account, Martijn Naaijer (2020) has convincingly argued 


that in the alleged Early Biblical Hebrew corpus the so-called 
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narrative text type and the direct speech sections differ consid- 
erably, and that the direct speech sections show similarities with 
the Late Biblical Hebrew texts (regardless of the distinction 
between narrative and direct speech in the latter). In other 
words: in late texts, there is less of a difference between narrative 
and direct speech. 


3.3. Author Clustering 


Perhaps the most cutting-edge application of machine learning 
and computational linguistics to biblical studies can be found at 
the research group at Bar Ilan University that is responsible for 
the Tiberias Stylistic Classifier for the Hebrew Bible (above, 
§§1.5., 2.4., and 2.5.). Their tools distinguished between Priestly 
(P) and non-Priestly (non-P) texts in the Pentateuch, thus agree- 
ing with a major conclusion of the Documentary Hypothesis 
(Dershowitz et al. 2015).”* This is not only a milestone in the 
application of computational linguistics in biblical studies. It also 
shows where the interaction between computational linguistics 
and biblical scholarship can now take place, because the outcome 
of the computational analysis is not merely an ‘objective proof 
of a scholarly hypothesis, but rather the start of new scholarly 
reflection as articulated and tested with the iterative develop- 
ment and application of computer algorithms. Questions that 
arise are: How can we account for the few verses that have been 


classified as non-P in traditional scholarship, but were assigned 


?? Another interesting case study is the assignment of Isaiah 34-35, for 
which it has been argued that these chapters were written by Deutero- 
Isaiah (Berman 2021). 
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P in the computational analysis and vice versa? Do they reveal 
flaws in the algorithms, or should we rather reconsider their 
assignment to P or non-P? (For this dilemma see also above, 
§2.4.) What does the outcome tell about the other elements of 
the Documentary Hypothesis, such as the J, E, and D sources, that 
cannot be distinguished by the algorithms?*? Would scholars ever 
have set out to answer such a question with computational lin- 
guistics if the hypothesis had not already existed? The algo- 
rithm’s ability to distinguish between P and non-P means they 
consistently differ, but what confidence do we have that they 
differ in the way scholars have claimed they do (e.g. in terms of 
authorship and date)? 

Another question relates to the notion of author recognition 
and computer programs that are built to detect unconscious indi- 
vidual elements of language use and an author’s “subtle stylistic 
preferences” (Dershowitz et al. 2015, 253). How can such an 
approach be applied to compositions such as P that in Old Testa- 
ment scholarship are usually considered the work of groups of 
scribes, or as consisting of successive editorial layers?”* The algo- 
rithms will not reproduce S. R. Driver’s (1913, 131-35) list of 
words and phrases typical of P, and the notion of an author as an 


individual that can be identified on the basis of unconscious 


3 Cf. Dershowitz et al. (2015, 270): “There appear to be two possible 
explanations for this: (1) the J and E source are not sufficiently distinct 
from one another in terms of word usage (...); (2) the traditional J/E 
division is flawed.” 


4 See, e.g., Smend (1978, 57), on the supposed successive stages of the 
composition of P. 
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authorial fingerprints seems to be remote from the priestly circles 
like the alleged Sitz im Leben of P in traditional Old Testament 
scholarship. Yet, traditional source criticism and cutting-edge 
author-clustering algorithms largely arrive at a similar distinc- 
tion between P and non-P. Here is both the requirement and 
opportunity to reconcile the claims of what has been called 
‘algorithmic criticism’ (Verhaar 2016) with those of traditional 
scholarship. Or in other words, to proceed from Humanities 1.0 
(traditional source criticism) through Humanities 2.0 (source 
detection with author-clustering algorithms) to Humanities 3.0 
(cf. above, §1.4). 


4.0. Prospects for Further Study, Application, and 


Collaboration 


4.1. Syntax and Semantics 


Most of the database projects that began in the 1970s and the 
1980s started with syntax. The Andersen-Forbes database also 
includes semantic roles, but the way in which the labels have 
been assigned is not always clear and hence they are difficult to 
reproduce (and therefore assess). The other databases currently 
available also have a strong focus on syntax. This focus is under- 
standable from the positivist climate in which these projects orig- 
inated and the uneasiness that was felt with contemporaneous 
etymologising lexicographical approaches. But now, about half a 
century later, it is crucial to investigate how computational lin- 
guistics can be applied to the semantics of Biblical Hebrew. Oth- 


erwise, what happens is that advanced syntactic databases are 
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enriched with digital representatives of the scholarly knowledge 
of the nineteenth and twentieth centuries as it is codified, for 
example, in Brown, Driver, and Briggs’s 1910 lexicon. This is 
what we see happen in the commercial or semi-commercial Bible 
software packages in which both the advanced syntactic data- 
bases discussed in this chapter and the older lexicographical 
resources have become available. 

There are two clear ways in which the current syntactic 
databases could be extended towards semantic analysis. The first 
relates to the intersection of syntax and semantics. The search for 
valence patterns provides new insights about the meaning and 
usage of a verb. Hence one way to proceed is to enrich the syn- 
tactic labels with verbal valence patterns and the associated 
semantic roles according to strict criteria of how valence patterns 
and meaning interrelate (cf. Dyk 2016). For the study of verbal 
valence and clause patterns, the application of existing approach- 
es, especially those that have been applied successfully in compu- 
tational linguistics such as Role and Reference Grammar, appears 
to be promising (cf. Canu Hgjgaard 2019). 

Another way in which the current database could be 
extended to semantic analysis is the application of methods that 
have been developed in computational lexicography and seman- 
tics to the Hebrew Bible. Obviously, not everything that has been 
developed in this field is applicable to the Bible, which is, lin- 
guistically speaking, a limited corpus without native speakers. 
Thus, building a WordNet for Biblical Hebrew would meet with 
many complications. What could be promising, however, is to 


experiment with approaches such as co-occurrence analysis, 
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topic modelling, and similar methods to establish the relations 


between words.” 


4.2. Linked Data and Geospatial Analysis 


An extension of semantic and lexicographic information may be 
the interlinking with other resources. In recent years, the ETCBC 
has explored the potential use of Linked Data in which textual 
data is linked to encyclopedic or geospatial data. Pilot projects 
include Linking Syriac Data (2017-2018);” Linking Syriac Litur- 
gies (Van Peursen and Veldman 2018); and Linking Syriac Geo- 
graphic Data (see Van Peursen 2018). Although it is wonderful 
that this brings the textual data (in these projects: Syriac data) 
into the Linked Data universe, there is the danger that encyclo- 
pedic information takes the place of sound syntactic analysis. If, 
for example, we want to map all the geographical entities men- 
tioned in the Syriac Book of the Laws of the Countries, we have 
to decide how to identify the places mentioned or to locate the 
peoples mentioned in those texts. The same can be said of the 
famous catalogue of nations and peoples gathered at Jerusalem 
in Acts 2:9-11 (Van Altena 2022, 135-57). Such questions may 


?5 Such new initiatives could be linked with or even incorporated in the 
most up-to-date digitally-available lexicographical and semantic resources, 
such as those of the Semantic Dictionary of Biblical Hebrew (https:// 
semanticdictionary.org, accessed 4 May 2023) and the Semantics of 
Ancient Hebrew Database (https://www.sahd.div.ed.ac.uk, accessed 4 
May 2023) projects. 


© See https://github.com/hvlaardingerbroek/LinkSyr, accessed 4 May 
2023. 
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be even more challenging in the case of biblical studies, because 
of the debate over the extent to which the biblical accounts can 
be related to the history and geography of ancient Israel and 
given the uncertainties about the identification of places and 
events in the Bible. 

In addition to these interpretive difficulties, there is the 
challenge that we mentioned in §1.3 above. If linked geospatial 
data do not go beyond a mere digital representation of the 
well-known traditional atlases of the Bible, or of the maps of 
Jerusalem, ancient Israel, the ancient Near East, and the Roman 
Empire often included in printed Bibles, this only serves practical 
purposes, rather than representing a methodological innovation. 
However, given the ‘spatial turn’ in biblical studies (cf. Van 
Altena 2022, 41), it is to be expected that geospatial analysis, 
when applied properly, can lead to new insights and a better 
understanding of the Bible, even though its application to the 


Bible is still in its infancy. 


4.3. Collaboration and Open Science 


Another field where progress can be made is the comparison of 
the various linguistic databases of the Hebrew Bible. Each data- 
base has its specific approach, and the user is most helped by 
being able to compare the various databases, their underlying 
assumptions, and the way in which these assumptions resulted in 
the annotations in each verse of the Bible.”’ It is a pity, however, 


that anyone who wants to compare the three most elaborate 


27 A nice comparison is made in Miller-Naudé and Naudé (2018). 
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databases available to date (cf. §81.1 and 2.3) needs to purchase 
them in commercial software packages. Only the ETCBC database 
and the Westminster Hebrew Syntax Without Morphology (cf. 
n. 6 above) are publicly available. Bringing the other databases 
into the open access domain is easier said than done, given copy- 
right issues and business models, as well as practical challenges, 
but hopefully these challenges can be resolved in the near future. 
This will be necessary to enable scholarly pursuits engaging with 
all the databases. 

Open Science, however, is more than making databases 
available. It relates also to the transparency of analytical proce- 
dures and the availability of queries and algorithms. A break- 
through in the application of computational linguistic analysis to 
the Hebrew Bible would be the availability of the workflows of 
the data creation processes and the programs that have been used 
in the creation of those databases and of the algorithms that are 
currently being developed for advanced cutting-edge approaches 
as those mentioned in §§2.4. and 3.3. 


4.4. Computational Linguistics and the New 


Testament 


This chapter focused on the use of computational linguistics in 
Old Testament studies. In New Testament studies we see parallel 
developments, although syntactic databases emerged somewhat 
later than in Old Testament studies.”® 


*8 For the pioneering work in the 1970s and 1980s see Mealand (1988). 
However, in the first decades of the emerging field of ‘Bible and 
Computer’ relatively more attention was paid to the Old Testament and 
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The morphological encoding started, as in the case of the 
Old Testament, in the 1970s, with, e.g., the GRAMCORD Greek 
New Testament (first published 1977); the work of Timothy and 
Barbara Friberg, who produced the Analytical Greek New Testa- 
ment (first published 1981); and MorphGNT, which was initiated 
in the 1980s by Robert Kraft at the University of Pennsylvania’s 
Center for Computer Analysis of Texts (CCAT) and received 
major updates and corrections by James Tauber from the 1990s 
onwards.”° 

The computational syntactic analysis of the New Testament 
received an impetus from two projects that started in the first 
years of the twenty-first century.*° The first project is the Greek 


Hebrew than to the New Testament and Greek. This is reflected, for 
example, in the contributions to the AIBI conferences (cf. above, §1.2). 
The contributions to the first AIBI conference (Leuven, 1985) included 
twelve contributions that dealt exclusively with Hebrew and the Old 
Testament and only four that dealt with Greek and the New Testament 
(besides eight other contributions). The second conference (Jerusalem, 
1988) showed similar statistics. It contained seventeen contributions on 
Hebrew and the Old Testament, two on the Septuagint, two on Greek 
and the New Testament and one on the Greek works of Gregorius of 
Nyssa (besides thirteen other on general issues or discussing both the 
Old and the New Testament). 

29 Available on GitHub: https://github.com/morphgnt, accessed 4 May 
2023. 

3° Because of copyright issues, these open-source projects are often 
based on the older editions by Nestle, Tischendorf and Westscott, and 
Hort, or on the SBL Greek New Testament, rather than on the most 
recent Nestle-Aland edition. For a morphologically annotated version of 
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New Testament of the OpenText.org initiative by Stanley E. Porter 
at McMaster Divinity College and partners. The goal of this pro- 
ject is, according to its website, “to construct a representative 
corpus of Hellenistic Greek (including the entire New Testament 
and selected Hellenistic writings of the same period) to facilitate 
linguistic and literary research of the New Testament docu- 
ments.” At clause level their annotations include four major 
categories: Subject, Predicator, Complement, and Adjunct. 

Some more syntactic categories (e.g., object, second object) 
are distinguished in a project of the Asia Bible Society, namely 
the Greek syntax trees produced by Andi Wu and Randall K. Tan 
(who was also involved in the OpenText.org project) and made 
available through Clear Bible (formerly Global Bible Initiative).*" 
These data interact well with other tools such as the Lowfat 


Syntax Tree Browser.” 


the Byzantine Text see https://github.com/byztxt, accessed 4 May 
2023. 


3! Greek syntax trees: https://github.com/biblicalhumanities/greek- 
new-testament/tree/master/syntax-trees, accessed 4 May 2023; Clear 
Bible: https://www.clear.bible, accessed 4 May 2023. 


2 See _https://github.com/biblicalhumanities/greek-new-testament/ 


tree/master/syntax-trees/reader/doc, accessed 4 May 2023. For a new- 
er release see https://github.com/Clear-Bible/macula-greek, accessed 4 
May 2023. 
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A project to bring the data from the OpenText project and 
those from the Asia Bible Society together in Text-Fabric is car- 
ried out by Oliver Glanz at the Center of Biblical Languages and 
Computing (CBLC) at Andrews University. 

Whereas computational Old Testament studies had a strong 
linguistic focus from the 1970s onwards, in New Testament stud- 
ies there were other areas in which the potential of the computer 
was explored first, such text editing, stemmatology, and manu- 
script studies. The computer program Collate, developed by Peter 
Robinson in the late 1980s (succeeded in 2010 by Collatex**) was 
soon adopted by New Testament scholarship in Birmingham and 
Miinster for text comparison and text editing. In the early 1990s, 
Gerd Mink, one of the editors of the Editio Critica Maior (ECM) of 
the Greek New Testament, developed the Coherence-Based Gene- 
alogical Method (CBGM; Wachtel 2019). This method was partic- 
ularly apt to deal with the typical features of the transmission of 
the New Testament, such as the high degree of contamination, 
which hinders the traditional genealogical tree-model (Gurry 
2016). 

New Testament scholarship has also made great progress in 
manuscript imaging (see various contributions in Hamidovié et 
al. 2019). Hundreds of manuscripts have been digitised and high- 
quality images can be studied and annotated in the Virtual Manu- 


script Room of the Institut fiir Neutestamentliche Textforschung 


33 See https://github.com/CenterBLC/NA, accessed 4 May 2023. 
34 See https://collatex.net/about, accessed 4 May 2023. 
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(INTF) in Miinster.*> And as in the case of Old Testament studies, 
new experiments with data meaning, text reuse detection, and 
the use of NLP for semantic information extraction have appeared 


on the scene (for examples see Hamidovic¢ et al. 2019).°° 


5.0. Further Reading 


The various databases discussed in this chapter do not provide 
final answers, but are useful tools, each of them situated in the 
complex field of linguistic theories. It is therefore extremely 
important to use them in consultation with the documentation 
listed below. 

Those who want to do more advanced analysis are advised 
to develop some basic programming skills and use the datasets 
that are available as a whole on GitHub or another platform, 
rather than only with a user-friendly search interface. 

In the case of the ETCBC database, for example, Meeuse 
(2021) is a good starting point for exploring the database through 
the user interface of the SHEBANQ website, but much more 
advanced research (as in the examples mentioned in §3.2) is 
possible for those who have mastered Python and use the Python 


package Text-Fabric to analyse the Hebrew Bible.*” 


35 See https://ntvmr.uni-muenster.de, accessed 4 May 2023. 

3° See also above, §4.2 on geospatial analysis in New Testament studies. 
3? SHEBANQ website: https://shebanq.ancient-data.org; ETCBC data- 
base on GitHub: https://github.com/ETCBC/bhsa; Text-Fabric: https:// 
github.com/annotation/text-fabric; Python courses: https://www. 
codecademy.com or https://www.udemy.com/user/fredbaptiste. All 
accessed 4 May 2023. 
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EMERGING FROM SILOS OF ANALYSIS: A 
COMPLEXITY THEORY APPROACH TO 
THE STUDY OF BIBLICAL TEXTS 


Sophia L. Pitcher 


Language is a complex system (Larsen-Freeman 1997; Larsen- 
Freeman and Cameron 2008; Ellis and Larsen-Freeman 2009a). 
Together, the many dimensions of language comprise an intricate 
nexus of heterogenous components that often interact in sur- 
prising ways. Some of the dimensions of language—grammar, 
orthography and the written transmission of texts, language 
acquisition, and language variation—comprise complex systems 
in themselves. For example, language acquisition, or more pre- 
cisely an individual’s developing proficiency in a language (see 
Larsen-Freeman and Cameron 2008, 157; Dornyei 2009; Ellis and 
Larsen-Freeman 2009b; Naudé 2012, 64), involves dynamic 
interactions between multiple agents. Speech communities, in 
constant flux, are comprised of individuals at various levels of 
proficiency who exhibit varying patterns of language use (see 
Mufwene 2008, 3; Beckner et al. 2009, 12-15; Naudé 2012, 67- 
68, 69). Language variation is similarly complex, as it encom- 
passes variation manifested synchronically and diachronically. 
Synchronic language variation happens in real time at the level 


of individual speakers (viz. idiolects) and communities of 
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speakers (viz. dialects; see Mufwene 2008, 15-28; Beckner et al. 
2009, 7, 9, 12-15; Blythe and Croft 2009, 48; Naudé 2012, 67- 
68), while diachronic language variation involves the interplay 
between culturally transmitted linguistic structures that “persist 
for millennia” (Naudé 2012, 73) and those that do not, resulting 
in the “renovation of morphosyntactic structure” (Givén 2009, 
43; Naudé 2012, 73) over time (see also Beckner and Bybee 
2009). 

Higher-level dimensions of language can be reduced to 
reveal complex components at lower levels of organisation. For 
example, the components of the grammar of a language include 
syntax, morphology, phonology, semantics, and discourse- 
pragmatics. These components interface at various levels and to 
varying degrees, forming dynamic and often nonlinear phenom- 
ena. For example, the phonological dimension not only includes 
a language’s sound system as represented by its consonants, 
vowels, and lexical tones (if present) but also prosodic phonol- 
ogy, with its own array of constituent features that interact with 
morphological, syntactic, semantic, and discourse-pragmatic 
domains. 

Conceptualised as a complex system, language consists of 
multi-dimensional layers that are distinct but nonetheless inter- 
related—each layer influencing the other, while simultaneously 
shaping the system as a continuously evolving whole. This hier- 
archical interconnectedness characterises the nature of complex 
systems. As Baicchi (2015, 10) describes, complex systems are 
“heterogeneous entities that, interacting with each other and 


with their environment, generate multiple layers of collective 
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structure exhibiting hierarchical self-organization without cen- 
tralized control.” Characterising language in such a way encour- 
ages analyses that are also multi-dimensional and that seek to 
account for linguistic phenomena as they exist within the system. 
Although a complex systems approach may threaten analysis 
with a bewildering degree of variability, it offers two clear and 
compelling benefits: (1) an awareness of the empirical nature of 
the object of study, which includes the presence of other phenom - 
ena that share its nexus, and (2) theoretical grounding that aims 
to provide greater analytical integration of seemingly disparate 
factors operating within the system. 

In other words, a complex-systems approach to language 
recognises that isolated analyses artificially simplify and often 
obscure the object of study (see Larsen-Freeman 1997; Larsen- 
Freeman and Cameron 2008, 9; Schwartzhaupt 2013, 262). Per- 
haps more fundamentally, it “offers greater coherence in explain - 
ing what [is] already know[n]” (Larsen-Freeman and Cameron 
2008, 11). It is for these reasons that a complexity approach to 
language is beginning to influence the way language phenomena 
are understood within certain subfields of linguistics. This sys- 
tems view is also beginning to impact the way biblical scholars 
analyse, teach, and translate biblical texts. 

To introduce the merits of a complexity approach to the 
study of biblical texts, this paper will discuss the origins of Com- 
plexity Theory (CT) and its historical development within the 
field of linguistics (81), the seven core attributes of complex sys- 
tems (§2), and the ways in which a complexity approach has been 
fruitfully applied to biblical scholarship (83). 
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1.0. Complexity Theory: Origins and Historical 


Development 


Baicchi (2015, 10) identifies the ancient Greek philosopher 
Heraclitus (ca 500 BCE) as articulating the earliest philosophical 
expressions on the dynamism and unity of natural phenomena 
(see also Larsen-Freeman and Cameron 2008, 6). Heraclitus 
described the material world as interconnected and in continuous 
flux. This perspective challenged a static philosophy of the mate- 
rial world, “which described reality as if it were composed of 
static individuals and the state of being as conceptually ‘simple’, 
internally unchangeable and undifferentiated” (Baicchi 2015, 
10). Baicchi (2015, 10) also cites the ‘process philosophy’ of 
Alfred Whitehead as advancing Heraclitus’ notion of dynamicity 
and originating ‘systems thinking’ in the early twentieth century. 
In his essay Process and Reality, Whitehead (1929) argues for a 
philosophical balance between “the metaphysics of substance... 
and the metaphysics of flux” (1929, 209). Whitehead does this 
by postulating that a “particular existent” or entity is character- 
ised by the notion of concrescence—a “fluency inherent in [its] 
constitution” (1929, 210). He argues that “the real internal 
constitution of a particular existent” (1929, 210) is an intercon- 
nected, dynamic process of a larger whole (1929, 211, 214-215): 
‘Concrescence’ is the name for the process in which the 
universe of many things acquires an individual unity in a 
determinate relegation of each item of the ‘many’ to its 
subordination in the constitution of the novel ‘one’. The 
most general term ‘thing’—or, equivalently, ‘entity’— 


means nothing else than to be one of the ‘many’ which find 
their niches in each instance of concrescence. Each 
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instance of concrescence is itself the novel individual 
‘thing’ in question.... The notion of ‘organism’ is combined 
with that of ‘process’ in a twofold manner. The community 
of actual things is an organism; but it is not a static organ- 
ism. It is an incompletion in process of production.... In 
this sense, an organism is a nexus. Secondly, each actual 
entity is itself only describable as an organic process. It 
repeats in microcosm what the universe is in macrocosm. 
It is a process proceeding from phase to phase, each phase 
being the real basis from which its successor proceeds 
towards the completion of the thing in question. Each 
actual entity bears in its constitution the ‘reasons’ why its 
conditions are what they are. These ‘reasons’ are the other 
actual entities objectified for it.... Thus each actual entity, 
although complete so far as concerns its microscopic pro- 
cess, is yet incomplete by reason of its objective inclusion 
of the macroscopic process. 


Whitehead understands an individual category or entity to be the 
perception of a process at a particular moment in time. Further- 
more, he concludes that an individual entity or occasion may be 
analysed, but the analysis is by nature transitory. 

While these philosophical antecedents of complexity are 
noteworthy, Larsen-Freeman and Cameron (2008, 2-3) credit 
mid-twentieth century scientists with formulating the most influ- 
ential precursors of CT. They cite Conrad Waddington’s descrip- 
tion of embryogenesis in 1940 as a pivotal systems description. 
Within this conceptual framework, an embryo is not fully deter- 
mined by genetic information, but rather (Larsen-Freeman and 
Cameron 2008, 2): 


each step in the process of development creates the condi- 
tions for the next one. In other words, “the form of the 
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body is literally constructed by the construction process 

itself—and is not specified in some pre-existing full instruc- 

tion set, design or building plan...” (van Geert 2003, 648- 

649). 
In a similar vein, in 1950, biologist Ludwig von Bertalanffy pro- 
posed a general systems theory that described “an entity as the 
sum of the properties of its parts” (Larsen-Freeman and Cameron 
2008, 2). In doing so, he eschewed reducing an entity to any one 
of its parts, underscoring the importance of the “relationships 
among the parts which connect them to the whole” (Larsen- 
Freeman and Cameron 2008, 3). 

Over the next three decades, scientists continued to develop 
a systems approach to natural phenomena (see Larsen-Freeman 
and Cameron 2008, 3—4).' In the 1980s, this field of research 
culminated in the founding of the Santa Fe Institute. The institute 
quickly became the hub for exploring the nature of complex 
adaptive systems, where CT was formalised and multidisciplinary 
research flourished (Larsen-Freeman and Cameron 2008, 3). The 
application of CT has been extended to such diverse fields (see 
Larsen-Freeman and Cameron 2008, 3-7; see also Pearce and 
Merletti 2006) as business management (Battram 1998), physics 
(Gell-Mann 1994), epidemiology (Pearce and Merletti 2006), 
psychology (Spivey 2007), economics (Arthur 2013), and trans- 


lation and development studies (Marais, 2014). 


' Marais (2014, 19) also cites chaos theory as an influential precursor 
to the development of CT. Chaos theory is a branch of mathematics that 
explores the underlying order of systems that exhibit random states of 
disorder and irregular patterns (e.g., the weather as demonstrated by 
the butterfly effect [Gleick 1987, 9-31; Dooley 2009]). 
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At the close of the century, Larsen-Freeman pioneered a 
complexity approach to applied linguistics, exploring the merits 
of a complexity model for language acquisition and second lan- 
guage instruction. Her primary hope for developing complexity 
thinking within her field was that “the dynamics of complex 
nonlinear systems [would] discourage reductionist explanations” 
(Larsen-Freeman 1997, 142). For Larsen-Freeman, much of the 
theoretical appeal of CT lies in the dynamic and holistic meta- 
phor that it engenders, enabling researchers to perceive funda- 
mental properties and interactions of language phenomena that 
remain opaque to traditional theoretical frameworks. She argues 
that the insights that emerge from a complexity approach are not 
equally accessible to these frameworks because at their core they 
reduce or isolate the object of study, and in doing so obscure it. 

According to Larsen-Freeman and Cameron (2008, 6), lin- 
guists have typically taken a binary approach to the description 
of language phenomena, as either an inner perspective of the 
mind (see Chomsky 1965), or an outer perspective of the world 
(see Weinreich et al. 1968). Linguists who take the inner per- 
spective typically investigate “mental competence rather than 
performance” (2008, 6), while those who have taken the outer 
perspective often seek to create “a model of language which 
accommodates the facts of variable usage” (2008, 6). Larsen- 
Freeman and Cameron (2008, 6) assess the deficiencies of these 
polar approaches to language theory in the following way: 

Applied linguists have... preferr[ed] to explain the facts of 

language... either through an appeal to a mental compe- 


tence... or by taking language use factors into account, 
showing patterns in variability.... But mental competence, 
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when it is seen to be “irreducibly self-contained, cannot 

meaningfully relate to the world outside” (Leather and van 

Dam 2003, 6)—which applied linguists must do—and the 

hybridity of more socially-oriented approaches have 

tended to treat the world (context) as an independent 
variable that influences linguistic form, not as a dynamic 
system itself. Here, perhaps, Complexity Theory may 
contribute to a resolution. 
A complexity approach to language resolves the tension of this 
dichotomy by rejecting the view that the environment is “exter- 
nal to and independent from the organism” (Larsen-Freeman and 
Cameron 2008, 7) and advancing the view that the dimensions 
of language “emerge from interrelated patterns of experience, 
social interaction, and cognitive mechanisms” (Beckner et al. 
2009, 2). 

Perhaps most fundamentally, Larsen-Freeman (2013, 370) 
understands CT to be a metatheory “still necessitating object 
theories” (viz. discipline theories). According to Larsen -Freeman, 
it is precisely this characteristic that endows CT with one of its 
greatest advantages—transdisciplinarity: CT “avoids the splin- 
tering of disciplinary knowledge and creates instead new forms 
of knowledge, which are thematic, cutting across disciplinary 
boundaries” (2013, 370). In fact, Marais (2014, 18) considers the 
trajectory of Western scholarship towards interdisciplinarity and 
complexity frameworks to be inevitable: 

In a sense, complexity thinking seems to be inevitable. The 

whole program of Western science has focused on ana- 

lyzing the parts of reality in order to understand them 


better.... Now, the realization is dawning on scholars that 
analysis can only take you so far, because only a small part 
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of reality is to be explained by the way parts are, or only a 
small part of reality can be understood by understanding 
the parts of it. Much of reality is to be explained not by the 
parts themselves but by the way in which they relate to 
one another or by the way in which they are becoming, the 
way in which constituent parts form wholes... The focus 
has thus shifted from an analysis of parts to a focus on the 
relationships and connections between parts and between 
parts and wholes. Also, the focus has shifted from an inter- 
est in phenomena to an interest in processes, that is, the 
way in which phenomena are the result of the interaction 
of their constituent parts. 


Marais is also quick to point out that while the notions of 
movement, process, and interdisciplinary synthesis factor into 
the descriptions of complex systems, they are not of primary 
importance. For Marais (2014, 18), stasis, being, and disciplinary 
analysis are equally important notions since they too characterise 
reality: 

The philosophical problems of stasis and movement, and 

of how both constitute reality, are within the purview of 

complexity thinking. Let me hurry to say that I do not sug- 

gest replacing analysis with synthesis or being with pro- 

cess. I hope to incorporate these binaries in a complexity 


view in which both sides of the binary find their rightful 
place in thinking about a particular phenomenon. 


2.0. Seven Core Characteristics of Complex 
Systems 
CT is chiefly concerned with the description and modelling of the 


relationships between the components of complex systems—par- 


ticularly, the relationships that emerge from changing and often 
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unexpected interactions among the components. According to 
Larsen-Freeman and Cameron (2008), seven core attributes char- 
acterise complex systems: 1) dynamism, 2) heterogeneity, 3) open- 
ness, 4) interconnectedness, 5) emergence, 6) nonlinearity, and 
7) adaptation. 

First, complex systems are dynamic. The dynamism of a 
complex system refers to its non-static nature; it is a system in 
constant flux (Larsen-Freeman and Cameron 2008, 29-30). In 
fact, Larsen-Freeman and Cameron assert that dynamism is likely 
the defining feature of language as a complex system (2008, 25) 
because “language, language use, and language development are 
continuously in action” (2008, 29). 

Second, unlike simple systems (e.g., a traffic light; see 
Larsen-Freeman and Cameron 2008, 27) that are comprised of “a 
small set of similar components... connected in predictable and 
unchanging ways” (2008, 27), complex systems embody a multi- 
plicity of dynamic, heterogeneous components. 

Third, complex systems are open. Unlike closed systems, 
those that are open allow energy, matter, or other influences to 
enter from the outside (Larsen-Freeman and Cameron 2008, 32). 
For example, a closed highway system would be one with a fixed 
number of vehicles and a fixed road capacity. However, in an 
open road system (2008, 31) where traffic continually increases 
but road capacity remains static, the roads will eventually “reach 
equilibrium in the form of gridlock” (2008, 32). Such a result 
initiates pressure to add road capacity, thereby influencing the 
context of the system. So, in an open highway system, free- 


flowing equilibrium can only be maintained by introducing 
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additional road capacity to the system. This means that open 
systems “not only adapt to their contexts but also initiate change 
in those contexts; these systems are not just dependent on context 
but also influence context” (Larsen-Freeman and Cameron 2008, 
34). 

A linguistic example of this attribute is provided in the 
lexicon—a subsystem of language that exhibits varying degrees 
of openness. Certain classes of words in the lexicon are more open 
than others; the addition or attrition of nouns, for example, is 
greater than that of prepositions. Furthermore, as an open sys- 
tem, the lexicon can undergo transformation generated by the 
interaction of lexical inputs and other dimensions of language. 
For example, the loss of case in Biblical Hebrew (see Waltke and 
O’Connor 1990, §8.1.c), a morphological change, initiated a pho- 
nological process that changed the vowel structure of segolate 
nouns. Specifically, the loss of the vocalic case marker resulted 
in a consonant cluster, triggering an epenthetic vowel, which 
then effected vowel harmony in the first syllable (e.g. *malk > 
malek > melek ‘king’; see Blau 2010, 54). 

This leads to the fourth attribute of complex systems— 
interconnectedness. Complex systems exhibit interconnectedness 
between their numerous components, dimensions, and contex- 
tual factors. A linguistic example of this attribute is illustrated in 
the morphophonemics of the Biblical Hebrew conjunction waw. 
As is well known, the coordinating waw morpheme has multiple 
allophones depending on its phonological context. However, 
Revell (2015, 43; 2016, 75-76) and Scheumann (2020, 58-59) 


observe that the proclitic allophones of waw are also determined 
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by their prosodic contexts—particularly, the position of waw 
within a phrase segmented by a disjunctive ta‘am. 

Conjunction waw takes a [games] before the second con- 

junct of each pair that has an initial accent [viz. an accent 

on the first syllable] [oi n'~> 1 Kgs 8.29].... Conversely, 

when a waw is attached to the first conjunct in each pair, 

it takes a schwa, even though these conjuncts also have an 

initial accent [n9°1 om Gen. 8.22].... The phonetics of the 

accented syllable do not play a decisive role for this vocal- 

isation of waw.... The vocalic change cannot be explained 

by phonetic assimilation or dissimilation rules. Rather, the 

distribution of the allophone 1 is determined by prosody 

(Scheumann 2020, 58-59). 

Moreover, scholars have proposed additional factors (see 
Scheumann 2020, 59-60) that may influence the phonological 
realisation of the conjunction waw with qames, including syn- 
tactic distribution (see Gesenius 1910; Revell 1981; Waltke and 
O’Connor 1990; Jotion and Muraoka 2006; van der Merwe et al. 
2017), semantics (Waltke and O’Connor 1990; van der Merwe et 
al. 2017; see also Revell 2015; 2016), and rhythm (Gesenius 
1910; Jotion and Muraoka 2006). 

The fifth attribute of complex systems is emergence. Com- 
plex systems exhibit emergent behaviour when dynamic interac- 
tions between lower-level components and dimensions of the 
system give rise to new phenomena that contribute to the nature 
of the system (Larsen-Freeman and Cameron 2008, 58-60; 
Marais 2019b, 47-49). For example, “subatomic particles give 
rise to atoms, atoms to molecules, molecules to organisms; or 


phonemes to words, words to propositions, propositions to 
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discourses” (Reid 2021, 33). In each case, the whole of the sys- 
tem—organism or discourse—is not simply the sum of its parts, 
but what emerges when the parts of the system interact. Ellis and 
Larsen-Freeman (2009a, 14-15) describe both idiolects and 
dialects as characteristically emergent: 

Language exists both in individuals (as idiolect) and in the 
community of users (as communal language). Language is 
emergent at these two distinctive but interdependent 
levels: An idiolect is emergent from an individual’s lan- 
guage use through social interactions with other indi- 
viduals in the communal language, whereas a communal 
language is emergent as the result of the interaction of the 
idiolects. 

The sixth attribute of complex systems is nonlinearity. 
Larsen-Freeman and Cameron (2008, 31) define this feature as 
“change that is not proportional to input” (2008, 31; see also 
Beckner et al. 2009, 16). An idiom is perhaps a simple example 
of nonlinearity because it is semantically non-compositional—the 
meaning of the whole expression (viz. output) cannot be derived 
from the meaning of its individual parts (viz. input; van den 
Heever 2013, II:106-110, 178).* For example, the input of wp. 
(‘seek’) and wa:-nx (‘the life’) yields an semantic output of ‘to 
want or try to kill someone’, not ‘to look for someone’s life’ (van 
den Heever 2013, 189-90). 


? Idioms (and their formation) constitute complex phenomena in them- 
selves (see van den Heever 2013). Van den Heever defines semantic 
non-compositionality as a complex concept “whose global meaning is a 
semantic extension of the combined meanings of its constituent 
elements” (2013, 178). 
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Another illustration of linguistic non-linearity is the process 
of grammaticalisation whereby a lexical item loses its semantic 
content and develops a grammatical function. For example, there 
is abundant cross-linguistic evidence for the diachronic grammat- 
icalisation of the copula into a focus marker (see Heine and Reh 
1984; Harris and Campbell 1995; Heine and Kuteva 2002; 
Lehmann 2015; Khan 2019). One of the pathways to grammat- 
icalisation of the copula (viz. input) begins with cleft sentences, 
and over time, in some languages, the copula ceases to serve as 
a verb (viz. output), and instead is interpreted as a particle 
marking focus.* For the languages where this has not occurred, 
Lehmann (2015, 125) states that the very existence of the 
“autonomous pattern” of the cleft sentence demonstrates that 
“the communicative function of focus is already minimally 
grammaticalized.” 

Finally, complex systems are adaptive. Larsen-Freeman and 
Cameron (2008) define adaptation as “the process in which a 


system adjusts itself in response to changes in its environment” 


>In many languages, the cleft sentence is the most explicit way of mark- 
ing focus (Lehmann 2015, 123-24). An example of a cleft construction 
in English is the sentence ‘It is X that Y’, where X is the focused NP and 
Y is the comment regarding the NP: It is JOHN that loves Mary (Khan 
2019, 14). 

* In some languages, cleft sentences exhibit evidence of further gram- 
maticalisation as the syntactic complexity of the construction simplifies 
through the loss of certain morphemes, such as the relativiser introduc- 
ing the subordinate clause: ‘It is X Y’ (Khan 2019, 15; see also Lehmann 
2015, 124). Khan (2019, 15) provides examples of this type of cleft 
sentence in Israeli Hebrew and Syriac. 
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(2008, 33). This attribute is evident in the written transmission 
of sacred texts, where editors adapt their translations to accom- 
modate a variety of editorial considerations (see Naudé and 
Miller-Naudé 2019, 201, 182-83; Reid 2021, 38-41). For exam- 
ple, Samaritan scribes chose to continue to use the paleo-Hebrew 
script instead of adopting the Aramaic square script. By doing so, 
they created an immediate visual distinctive for readers of the 
Samaritan Pentateuch (Reid 2021, 53). According to Reid (2021, 
53), the social schism between the Judean Jews and Samaritan 
Jews encouraged the use of a different script, which “transformed 
the visual impact of the manuscript and gave it a Samaritan 
identity” (2021, 53). 


3.0. Complexity Thinking in Biblical Scholarship 


This section provides an overview of complexity thinking as 
applied to five distinct areas of biblical scholarship. Naudé and 
Miller-Naudé have pioneered this approach, exploring complex- 
ity models for diachrony (§3.1), language pedagogy (§3.2), and 
the translation of sacred texts (83.3). This framework has been 
extended to include a complexity model for the Masoretic accents 
of the Tiberian Masoretic Reading Tradition (Pitcher 2020; §3.5), 
a complexity approach to syntactic coordination in Biblical 
Hebrew (Scheumann 2020; §3.4), and a paradigm for under- 
standing the Samaritan Pentateuch as an intralingual translation 
(Reid 2021; §3.3). 
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3.1. Diachrony 


Naudé (2012) models a complexity approach to the description 
of language change and diffusion in Biblical Hebrew. His stated 
aim for this approach (2012, 61) is to promote “viewing the 
diverse aspects of language change holistically (that is, under- 
standing the causal dependencies and emergent processes among 
the elements that constitute the whole system) rather than 
viewing them partially and in isolation.” Naudé enumerates four 
fundamental dimensions of language change and diffusion that 
must be accounted for in any substantive systems description of 
diachrony: (1) the dimension of the individual idiolect, (2) the 
social dimension within a community of speakers, (3) the dimen- 
sion of the process of chronological change over time, and (4) the 
development of written texts. He argues that an understanding of 
the dynamic interplay between each of these dimensions renders 
“the simple linear model of Ancient Biblical Hebrew, Early Bibli- 
cal Hebrew, Late Biblical Hebrew, and Mishnaic Hebrew” unten- 
able, yielding a more accurate and complex model of the lan- 
guage development (2012, 71). 

Within his system of diachrony, Naudé (2012, 72) describes 
the dimension of the individual as a domain where the formation 
of idiolects arises from the “innate grammar of an individual 
speaker.” He states that language change at this level reflects the 
differences between the grammar of the individual and “that of 
the input source (for example, child versus parents).” Change at 
this level is ongoing throughout the individual’s life and “always 


emergent” (2012, 72) as cognition develops and continues to 
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shape the individual’s competencies in the use of grammar (2012, 
70, 72; see also Givoén 2009). 

The social dimension of diachrony encompasses the changes 
that emerge within the dialects of communities of speakers. In 
this dimension, the diffusion of idiolects is at work, as “one dif- 
fering grammar becomes dominant and gains acceptance by the 
local speech community and later by society at large” (Naudé 
2012, 72). Naudé notes that diffusion always involves multiple 
idiolects that are at once competing with and being influenced 
by the standard language. 

The dimension of time recognises that language change and 
diffusion are processes that develop chronologically and often in 
observable patterns (2012, 72-73; see also Pintzuk 2003). 
According to Naudé (2012, 73), linguistic structures accumulate 
through the daily communicative interactions of speakers in a 
community, many of which are culturally transmitted to future 
generations in the form of “fossilized linguistic structures.” The 
complex process by which some linguistic structures (e.g., mor- 
phological, phonological, syntactic) “persist for millennia,” while 
other structures simplify or completely erode, is called the “dia- 
chronic cycle.” It is through this cycle that language undergoes 
“deep structured changes.” 

The fourth dimension of diachrony is the development of 
written texts. In order to account for this dimension, Naudé 
(2012, 73) asserts that the differences between speech and writ- 
ing must be identified. One fundamental difference is that 
“writing is secondary to speech and employs special forms... for 


its unique purposes (for example, the use of devices for the 


290 Pitcher 


organization of discourse).” Understanding these differences 
includes accounting for the use of the text and the context out of 
which it arises. 

For example, for Biblical Hebrew, it is important to under- 
stand the context of the Ancient Near East, where the written text 
was often used as an administrative tool and did not necessarily 
reflect “a connection with spoken language” (Naudé 2012, 73); 
this was an “advantage for [outstretched empires] and linguisti- 
cally unrelated agents.” Naudé (2012, 74-75) argues that this 
contextual description must also consider the socio-cultural 
nature of the epigraphic and biblical texts, including scribal 
culture; the political and economic circumstances at the time of 
composition or redaction; the types of discourses produced (viz. 
royal, history, law, prophecy) and the audiences they address 
(viz. the monarchy, the people, personal second person versus 
impersonal third person). Furthermore, biblical texts have an 
added layer of complexity because “they were transmitted 
through multiple editors and copyists, rather than archaeologi- 
cally excavated straight from their original context” (2012, 74). 
Naudé maintains that all these factors must be considered when 
explaining the process of diachrony in biblical languages. 

Naudé (2012, 75) illustrates the complexity of the dia- 
chronic cycle that “began in Biblical Hebrew, escalated in 
Qumran Hebrew, and was finished in Mishnaic Hebrew” using 
data “on the waw-consecutive and distribution of the independ- 
ent personal pronouns in Biblical and Qumran Hebrew.” He 
traces the chronological use of consecutive waw with perfective 


verb forms and notes that this form is statistically more prevalent 
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in Biblical Hebrew than Qumran Hebrew, and almost completely 
absent in Mishnaic Hebrew (2012, 76-77). While changes in 
these verbal forms throughout the various stages of Hebrew have 
been well-documented (2012, 77, n. 2), the theory of diachrony 
that Naudé (2012, 77-78) develops explains their significance: 
In the biblical period, the change occurred in one or more 
idiolects. The change then diffused so that, by the time of 
Qumran, there were fewer examples of the consecutive 
form and more examples of conjunctive waw. The exam- 
ples of consecutive forms that remained were stylistic fos- 
sils. By the time of the Mishnah, the cycle of change was 
complete: the consecutive forms occurred only in biblical 
quotations, and the vernacular language was structurally 
different from the language in the Bible. 
The system of diachrony advanced in Naudé (2012) moves tra- 
ditional descriptions of language development beyond notions 
that characterise the various forms and stages of biblical lan- 
guages according to the “synchronic styles available to biblical 
authors” (2012, 71). 


3.2. Language Pedagogy 


Miller-Naudé and Naudé (2014) present a complex systems 
approach to teaching and learning Biblical Hebrew by reading the 
biblical text—an approach that systematises some of the best 
teaching practices for Biblical Hebrew instruction. The method- 
ology they employ is based on pedagogical complexity models, 
which assume a theory of second language development that is 


multi-dimensional and involves “a process of dynamic adaptation” 
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(2014, 95; see Larsen-Freeman and Cameron 2008; Ellis and 
Larsen-Freeman 2009a).° 

Miller-Naudé and Naudé (2014, 92) select the act of read- 
ing as the basis of an effective complexity model for teaching 
biblical languages because the act of reading requires a range of 
complex skills ranging from “lower-level visual processing 
involved in decoding the print to higher-level skills involving 
syntax, semantics, and discourse, and even to skills of text rep- 
resentation and integration of ideas with the reader’s global 
knowledge” (Nassaji 2009, 173). Furthermore, they argue that 
the socio-political context of their classroom at a South African 
university itself provides grounds for developing such an 
approach. Their description of “the ‘new’ South Africa (since 
1994)” (2014, 92) celebrates a country with twelve official lan- 
guages and the integration of peoples who were once rigidly sep- 
arated from one another. Accordingly, one key dimension of their 
pedagogical approach has been to hire a multi-cultural and multi- 
lingual teaching staff that is able not only to teach in multiple 
languages (viz. Afrikaans, English, Sesotho), but also to provide 
parallel linguistic examples and explanations of Hebrew linguis- 
tic structures in these diverse languages. 

Another key contextual dimension that Miller-Naudé and 
Naudé’s (2014) pedagogical model accounts for is the manner 


in which South Africa’s national teaching requirements train 


° The model of language development proposed here contrasts with 
models of language acquisition which describe learners accessing a 
fixed “mental architecture of language” (Miller-Naudé and Naudé 2014, 
95). 
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students to learn new information. They observe that these 
modes of learning tend to conflict with traditional methods for 
teaching Biblical Hebrew. Particularly, they have found that 
South Africa’s outcomes-based educational system is focused “too 
much on the application of knowledge” (2014, 93): 

As a result, students/learners typically have not acquired 

the skills for memorization. Traditionally, the teaching of 

Biblical Hebrew has relied upon memorization for the 

acquisition of vocabulary and verbal paradigms, but this 

method is almost impossible for students whose previous 
academic experience has exclusively relied on Outcomes 

Based Learning (2014, 93). 

The six dimensions of Miller-Naudé and Naudé’s (2014) com- 
plexity model for teaching Biblical Hebrew are as follows: (1) the 
use of a typological framework for introducing grammatical fea- 
tures, (2) the use of the five complex cognitive skills that must be 
developed when learning to read, (3) building students’ vocabu- 
lary, (4) reading, singing, listening, and simplified speaking, (5) 
increasing students’ understanding of the cultural context of the 
biblical text, and (6) the application of exegesis and theology to 
the biblical text. 

The first dimension of the model involves establishing a 
typological framework for teaching the grammatical features of 
the biblical language. A typological framework is particularly 
suited for this because it classifies languages that are “genetically 
unrelated and that have no geographical proximity” (2014, 96) 
according to “shared formal characteristics” (Whaley 1997, 7). 
Miller-Naudé and Naudé argue that language typology is a key 


pedagogical feature because it allows instructors “to describe the 
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grammatical features of [the biblical language] to speakers with 
different mother tongues” (2014, 96). For example, they observe 
that English- and Afrikaans-speaking students often find the 
Hebrew construct phrase more difficult to grasp because in these 
languages “the genitive precedes the noun... a predominant 
pattern among the world’s languages” (2014, 96). However, 
speakers of African languages such as Sesotho quickly grasp this 
concept because the noun-genitive pattern in these languages 
corresponds to the pattern found in Hebrew. Miller-Naudé and 
Naudé (2014, 96-97) state that these types of conceptual difficul- 
ties with the features of a biblical language can be mitigated by 
explaining them in terms of the typological differences found 
among the languages of the world. 

The second dimension of the model is quite complex in 
itself because it incorporates the five following cognitive skills of 
reading (2014, 97-98): (1) learning to interpret and pronounce 
the orthography; (2) developing the ability to visually process 
words (viz. learning to identify a word by recognising the string 
of consonants); (3) developing reading comprehension skills by 
learning to “[select] the meaning of the word that is relevant 
to the context” (2014, 98); (4) learning to identify discourse- 


level features;° and (5) learning to interpret “culturally relevant 


© According to Miller-Naudé and Naudé (2014, 98), discourse-level 
features include “compound sentences, identification of the narrative 
structures of on-line and off-line information, volitive chains, the 
embedding of direct and indirect speech, and genre-specific features 
such as poetic lineation and poetic word pairs.” 
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information for processing the pragmatic inferences of biblical 
texts” (2014, 97). 

The third dimension of the model encompasses the teach- 
ing of vocabulary. For Miller-Naudé and Naudé (2014, 100), 
vocabulary instruction includes equipping students with not only 
sound files for hearing and pronouncing words but also the cul- 
turally relevant information needed to understand them in their 
textual contexts. 

The fourth dimension of the model involves several key 
activities—reading, singing, listening, and simplified speaking— 
that all reinforce new skills and allow students to integrate and 
practise the concepts that they are being taught. For example, 
reading activities include both an intensive and an extensive 
component. Intensive reading “involves close, deliberate study of 
short texts with attention to vocabulary, grammar and discourse 
of the text” (2014, 102), while extensive reading “involves a wide 
range of texts of different genres, text structures and language 
patterns.” Additionally, students are provided with audio files of 
their classroom materials, which affords them ample opportunity 
to listen to the biblical language. Finally, singing biblical verses 
and performing simplified oral drills and dialogues reinforces 
“specific morphological forms, syntactic constructions and 
vocabulary items” (2014, 101) in engaging ways. 

The fifth dimension of the model provides students with the 
necessary cultural information regarding the ancient biblical 
world so that they can “bridge the gap between their cultural 
context... and that of ancient Israel” (2014, 100). 
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Finally, the sixth dimension of the model involves the 
application of exegesis and theology to the biblical text. For 
Miller-Naudé and Naudé (2014, 100), this is the “ultimate goal 
of Biblical Hebrew language teaching” and the reward for stu- 
dents within the religious context of their institution. This dimen- 
sion enables students to apply and appreciate their growing 


knowledge of the language and culture of the biblical text. 


3.3. Translation Theory 


Marais (2019b, 43; see also 2014) describes translation as a com- 
plex process that emerges out of cause-and-effect relationships 
among a variety of dimensions, including “linguistics, pragmat- 
ics, culture, society, ideology, power, a brain, a human personal - 
ity and meaning” (2019b, 46-47). Marais (2019b, 44) argues that 
the process of translation—that is, “the process of making mean- 
ing”—is fundamentally a sign-process (viz. semiosis), entailing 
the interrelatedness of three elements: (1) the sign, (2) a mental 
representation of the referent triggered by the sign, and (3) the 
actual referent, “either an idea of something or the thing itself.” 
Conceptualising translation as a process means that the mental 
construct is only relevant for “a particular moment in the semio- 
sis”’—that is, the mental construct is “not absolutely final, only 
pragmatically final” (2019b, 44). 

Furthermore, Marais (2019b, 43, 46) argues that the incipi- 
ent text is as much an emergent phenomenon as the subsequent 
text, and that the process of translating is not linear or binary— 
there is no direct line from incipient text to subsequent text. For 


Marais (2014; 2019b, 46), a complexity model for translation not 


Complexity Theory 297 


only means that the translation process is recursive but also that 
the ‘turns’ of translation are not mutually exclusive. As such, 
previous translations of a text are not invalidated by subsequent 
ones, but instead are viewed as “complementary perspectives 
that contribute to a fuller understanding of the complexity of 
translation” (2019b, 46; see also Robinson 2017). 

Naudé and Miller-Naudé (2019) explore Marais’ complexi- 
ty model using the Book of Ben Sira in the Septuagint as a case 
study. Their complexity framework for Ben Sira serves as an 
alternative to the frameworks that are utilised in two recent 
modern translations—namely, the Septuaginta Deutsch project 
(LXX.D) and the New English Translation of the Septuagint project 
(NETS).’ Naudé and Miller-Naudé (2019, 181) consider these 
translations to be reductionist approaches because the former 
domesticates the text, while the latter renders it foreign to the 
reader. For example, NETS employs a literal translation of the 
Greek and incorporates transliterated terms. They observe the 
opposite, but equally reductionist, phenomenon with LXX.D as it 
veers too heavily in the direction of favouring German culture at 
the expense of the culture of the incipient text (2019, 181). 
Marais (2014, 40) describes the pull of these types of reductionist 
translation models as either being “too strongly biased toward 
the direction of the target [text],” as in the case of LXX.D, or “too 
strongly biased toward the source [text],” as in the case of NETS. 

Naudé and Miller-Naudé argue that their complexity 
approach to Ben Sira resolves this stark dichotomy by both 


7 See Ross (2021) for an overview of the range of recent Septuagint 
translation projects. 
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“respecting its alterity” (2019, 203) and being sensitive to “issues 
of intelligibility and representation.” They accomplish this by 
identifying two sets of dimensions that are inherent in sacred 
writings and incorporating them into the production of the trans- 
lated text, largely in the form of footnotes. 

The first set of dimensions characterise the religious nature 
of sacred texts. These dimensions explore the psychological and 
sociological aspects of religion, as well as the influence that the 
oral-writing traditions of the text have wielded over time. The 
psychological dimension “influences how individuals perceive 
and react to the environment in which they live,” while the socio- 
logical dimension encompasses a complex nexus of “intercultural 
and interlinguistic communication influenced by socio-cultural, 
organizational, and situational factors that result in self-critical 
corrections, adaptations, and apologies in religious discourse and 
practice” (2019, 182). Naudé and Miller-Naudé (2019, 181-182) 
observe: 

Sacred writings, which are texts beyond everyday life that 

inspire awe, respect, and even fear, are associated with 

religion and have various special functions or roles within 

a religious context (Sawyer 1999). As a complex phenome- 

non, religion and its sacred writings form an inextricable 

part of culture. Religion is a central part of human experi- 

ence, influencing how individuals perceive and react to the 

environments in which they live (Giddens 1993, 456). 

Naudé and Miller-Naudé (2019, 182) describe the oral-writing 
tradition of sacred texts as the words, texts, and language used in 
public worship, “which are the result of complex processes of 


canonization and translation.” They also recognise that most 
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religious communities require translated versions of their sacred 
texts, and that these texts “quickly assume the status of incipient 
(source) texts and become central to the religious domain.” 

Sacred writings, then, exhibit complex webs of interaction 

between numerous emergent, complex adaptive systems— 

the religious communities who produce and use sacred 

writings, the sacred writings as emergent incipient 

(source) texts, the sacred writings as emergent subse quent 

(target) texts, and in some instances, subsequent (target) 

texts as emergent sacred writings (Naudé and Miller-Naudé 

2019, 183). 
The second set of contextual dimensions for the complexity 
model accounts for the complex nature of the Septuagint and the 
place of Ben Sira within it. According to Naudé and Miller-Naudé 
(2019, 183-84), the Septuagint is itself a complex adaptive 
system for the following historical, text-critical, editorial, and 
socio-cultural reasons: (1) it has a complex history of origin and 
transmission; (2) the source text(s) for the translated texts of the 
Septuagint are pluriform and emergent “in that it did not reach 
its final, canonical form until many centuries later” (see also 
Ulrich 2015); (3) the Septuagint comprises “multiple translations 
(e.g., into Latin and Syriac) and revisions (e.g., by Aquila and 
Symmachus)” that were produced to serve the “needs and 
concerns of various religious communities, especially since the 
Protestant Reformation;” (4) although originally produced for 
Jews, they subsequently rejected it after it was adopted by 


Christians as their sacred text.* Furthermore, according to Naudé 


8 For an alternative view regarding this claim, see De Lange et al. 2009. 
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and Miller-Naudé, while “research on the Septuagint has been 
driven primarily by the needs of textual criticism and modern 
philology, either as a search for the Vorlage or Urtext (earliest) 
or as a search for the best or most authoritative final text,” anew 
philology approach to the Septuagint views each individual 
manuscript “as a meaningful, historical artifact, and variants 
found in these manuscripts are viewed... interesting in their own 
right.” 

Naudé and Miller-Naudé argue that a complexity model 
for Ben Sira, then, considers and incorporates these multiple 
dimensions. One of their strategies for avoiding foreignising or 
domesticating the text is by making use of footnotes in order to 
both provide the modern reader with access to the alterity (viz. 
foreignness) of the Hebrew incipient text and make this alterity 
comprehensible. 

For example, Ben Sira 24.13-17 depicts Lady Wisdom in 
terms of “a variety of flora that are mentioned in their ecological 
contexts” (2019, 189). According to Naudé and Miller-Naudé 
(2019, 190), since this portion of text lacks a Hebrew incipient 
text, these terms must be translated with respect to “both the cul- 
tural world of ancient Israel and the appropriation of those cul- 


tural terms in metaphorical contexts.” In this poetic passage, they 


° Naudé and Miller-Naudé (2019, 184) state that “new philology as a 
philological perspective within the larger field of editorial theory pro- 
vides a model broadly conceived for understanding texts, text produc- 
tion, and transmission—and for exploring texts in their manuscript 
contexts.” Note that Brill’s Septuagint Commentary Series assumes this 
approach (see Porter 2021 for a description). 
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provide three examples where the NETS translation chooses an 
English term whose referent either is not found in Israel or does 
not fit the metaphorical function of the plant in the passage, or 
both (2019, 190-93). Furthermore, in instances where additional 
information is needed to fill out a modern reader’s understanding 
of the term, footnotes “open up the cultural world of the text to 
the modern reader by indicating both the botanical characteris- 
tics and the social functions of the plant. The alterity of the text 
is retained and respected but without domestication of its for- 
eignness” (2019, 193). For Naudé and Miller-Naudé (2019, 193), 
footnotes are powerful metatexts that help retain the complexity 
of text by making the alterity of the incipient text accessible and 
intelligible to a modern reader. 

Following Marais (2014; 2019a; 2019b) and Naudé and 
Miller-Naudé’s (2019) models for translation, Reid (2021) demon- 
strates a similar use of complexity principles in his analysis of the 
Samaritan Pentateuch as an intralingual translation—that is, a 
rewording or reinterpretation of a text in the same lan guage. Reid 
presents a paradigm for understanding the Samaritan Penta- 
teuch, particularly, the variants found in the text, “the choices 
made by the editors... the context they worked in, and the reason 
they embarked on the project” (2021, 2). Unlike interlingual 
translations that “might be done simply to facilitate communica- 
tion between speakers of different languages” (2021, 50), Reid 
argues that “intralingual translations must be done for a particu- 


lar reason or skopos, as the original material is already accessible 
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in the target language.”!° Reid concludes that the skopos for the 
Samaritan Pentateuch is rooted in a particular set of scribal 
interventions that emerge from a complex interaction of multiple 
texts—e.g., proto-Masoretic, pre-Samaritan, Septuagint (2021, 
10-22, 71)—and multi-dimensional factors, including (1) the 
diachronic development of the Samaritan dialect and inter- 
dialectic variants (2021, 59-60); (2) editorial considerations, 
e.g., choosing to preserve the paleo-Hebrew script (2021, 53) and 
substituting archaic grammatical forms for those that were more 
commonly used (2021, 58); and (3) the “cultural rich points” that 
differentiate the Jewish and Samaritan traditions, e.g., the 
correct place of worship, Gerizim versus Jerusalem, (2021, 60- 
64). 


3.4. Syntax 


Scheumann (2020) applies complexity thinking to the grammat- 
ical feature of phrasal coordination in Biblical Hebrew because 
he recognises that “semantic observations of clausal coordina- 
tion” do not yield a comprehensive description of coordination, 
and that a purely syntactic analysis is equally insufficient (2020, 
11). His complexity approach provides a metatheory for the 
Minimalist Programme (see Chomsky 1995) that undergirds his 
syntactic analysis, allowing for other relevant dimensions of 
language to be considered (2020, 6, 10): 


10 In accordance with Vermeer (2004), Reid defines skopos as “the aim 
or purpose of a translation” (Vermeer 2004, 227; see also Reiss and 
Vermeer 2014; Nord 2018). 
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I approach [Biblical Hebrew] language as a connected 

whole with the interacting levels of syntax, semantics, 

prosody, morphophonology, and discourse.... Moreover, 
because coordination is so pervasive in the Pentateuch, it 
requires a grasp of many areas of [Biblical Hebrew] syntax 

that intersect with phrasal coordination: prosody, pronoun 

binding, information structure, word order, null elements, 

ellipsis, differential object marking, prepositions, nega- 

tion, markedness, apposition, subordination, and verbal 

agreement. 

Using cross-linguistic data to support his claim for the syntactic 
structure of coordination in Biblical Hebrew, Scheumann (2020, 
30-32, 56) proposes an asymmetric, hierarchical structure based 
on the following complex features: (1) asymmetric prosodic 
breaks exhibiting the segmentation between conjuncts at the 
phrasal and clausal levels (2020, 30); (2) the cliticisation of the 
coordinating particle to one of the conjuncts (2020, 30-31); (3) 
syntactic constraints on the locus of a coordinator within a clause 
(2020, 31); (4) constraints on which conjuncts are able to do the 
joining or binding within a clause (2020, 32); and (5) asymme- 
tries between verbal agreement and the structure of coordination 
(2020, 224-25). 

Scheumann observes that his hierarchical model for coor- 
dination has implications for understanding instances of partial 
verbal agreement and conjoined subjects (e.g., 2 Sam. 2.12). He 
concludes that partial agreement is the default pattern when the 
coordinate complex is post-verbal (e.g., Gen. 33.7), and that “the 
verb always agrees with the initial conjunct (e.g., Exod. 21.4), 
whether the coordinate complex is post-verbal or pre-verbal” 
(2020, 227). 
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While Scheumann determines that syntax is the primary 
factor for understanding coordination and verbal agreement 
asymmetries, he recognises that syntactic analysis alone is insuf- 
ficient to explain the different types of asymmetries that are 
found in Biblical Hebrew (2020, 238, 261). Having established 
the underlying syntactic structure for coordination, he deter- 
mines (2020, 246-49, 261) that the role discourse plays in the 
partial agreement of verbs with coordinated subjects is more 
accurately understood when considered together with other fac- 
tors such as conjunct word order, conjunct binding (viz. a coref- 
erential pronoun that binds to the second conjunct [2020, 84]), 


and the subsequent use of pro-drop (2020, 262): 


We have seen that in sentences with a post-verbal coordi- 
nate structure headed by a pronoun, the number of the 
verb is noteworthy. The compound is not the subject, but 
functions as a [quantifier phrase] that modifies the subject. 
A singular verb, thus, has discourse significance, because 
the subject is singular, which is coreferential with the 
pronoun conjunct. In this way, the pronoun referent is the 
principal actor because, syntactically, it is the only actor 
in terms of subjecthood. The prominence of the pronoun is 
highlighted further with conjunct binding and with a 
subsequent singular pro-drop verb. Conversely, a plural 
verb does not mark the conjuncts as equally-prominent 
actors. While both conjuncts in the [quantifier phrase] 
explicate the plural pro subject reference, the initial 
conjunct in the [quantifier phrase] can still be discourse- 
prominent, much like how the first conjunct in a com- 
pound subject can be the principal actor, whether the verb 
is singular or plural. 
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3.5. Masoretic Accents of the Tiberian Masoretic 


Reading Tradition 


The current accepted analytical model for the Masoretic accents 
of the Tiberian Masoretic Reading Tradition is rooted in a 
nineteenth-century philological algorithm known as the Law of 
Continuous Dichotomy (LCD, Wickes 1887), which reduces (i.e., 
artificially simplifies) the accents to a system of pausal segmen- 
tation. However, Pitcher’s (2020; 2023; see also 2021) model for 
the accentual system departs from the LCD as a tool of analysis 
in order to advance a complexity approach that integrates the 
system’s features, structures, and functions within a wholly 
linguistic framework. Pitcher argues that LCD-based analyses 
cannot appropriately account for or accommodate the complex 
linguistic data represented by the accentual system, and should, 
therefore, be replaced by a cross-linguistic complexity model for 
six main reasons: (1) LCD-based algorithms were designed to 
reconstruct the ‘correct’ sequences of accents in a verse (Wickes 
1881, Preface, 5-7; see also Pitcher 2023), not to identify and 
explain the complex linguistic data that they represent; (2) schol- 
ars have long acknowledged the prosodic nature of the system 
(Spanier 1927; Lazarus 1942; Avenary 1963; Janis 1987; Dresher 
1994; Strauss 2009; DeCaen and Dresher 2020; Park 2020), but 
the LCD does not treat the accents as natural speech phenomena; 
(3) the LCD is a philological algorithm with no theoretical basis 
outside of itself; (4) the LCD reduces the accentual system to 
pausal segmentation; (5) a cross-linguistic complexity model 
provides a more accurate description of the accentual system, 


integrating its seemingly disparate features (viz. melody, stress, 
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prosodic phrase structure, syntax interface, and meaning); and 
(6) a cross-linguistic complexity model provides the necessary 
theoretical framework to test and substantiate the complex dimen- 
sions of the accentual system’s features, structures, and functions. 
At its core, a cross-linguistic complexity model for the accentual 
system recognises that the multifaceted dimensions of lan- 
guage—including tonal and metrical structure, syntax, seman- 
tics, and discourse-pragmatics—comprise a prosodic system, and 
this model seeks to account for these dimensions within a com- 
prehensive linguistic analysis. 

The basis for this new analysis is partly grounded in the 
oral orientation of traditional descriptions of the accents and the 
notion of recitation found therein (see Spanier 1927, 9-17; 
Crowther 2015, 54-63; Eldar 2018, 53-56; Shoshany 2022, 23- 
24, 27-28; see also Khan 2013, 37, 43, 47; Khan 2020a, 1-3, 49- 
53), as well as the system’s close association with the vocalic 
phonology of the orally performed text (Khan 2013, 37-65; Eldar 
2018, 85-88; Khan 2020a, 96-97, 2020b, 1-3; Posegay 2021, 
25-26, 38-42, 82-84, 131; see also Crowther 2015, 50-51, 62- 
63). Furthermore, according to Yeivin (1980, 158), the accents 
are traditionally understood to perform three functions: (1) to 
represent the melodies that accompany the reading of the text, 
(2) to mark the locus of lexical stress, and (3) to mark semantic 
units (1980, 158)—that is, to contribute to the meaning of the 
text. A cross-linguistic complexity model unifies these functions 
within the theoretical framework of prosodic phonology. Within 
this model, the accentual graphemes constitute a prosodic 


orthography, where conjunctive and disjunctive graphemes are 
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iconic representations of pitch accents at three levels of contras- 
tive pitch—low, high, and extra high (Pitcher 2023; see also 
Wickes 1887, 13-14; Lazarus 1942, 283-86; Pitcher 2020, 145- 
58, 189-92;)—embedded within two domains of prosodic phrase 
structure. In other words, the orthography of the accents repre- 
sents pitch fluctuations in the flow of speech that are associated 
with the locus of lexical stress, with disjunctive accents addition- 
ally indicating the locus of two types of cross-linguistically dis- 
tinct phrasal boundaries—namely, those of the intermediate 
phrase (@) and the intonational phrase (1). 

This cross-linguistic complexity model, however, not only 
identifies the precise nature of the accentual system but also pro- 
vides a way to assess its functional semantic features. For exam- 
ple, relative clauses, which have attested cross-linguistic prosodic 
features (Nespor and Vogel 2007, 188), provide an accessible 
syntactic domain to test how the prosodic system represented by 
the accents contributes to the meaning of the biblical text. 
Semantically non-restrictive relative clauses, which provide sup- 
plemental information for an already identifiable referent, form 
intonational phrases separate from their head nouns. Semanti- 
cally restrictive relative clauses, which modify the head noun by 
restricting the identification of the referent, form cohesive into- 


national phrases with their head nouns.” 


' The prosodic data of relative clauses within the 21 Books of the 
Hebrew Bible also include prosodic formats that are prosodically ambig- 
uous with regard to restriction (Pitcher 2020, 333-55). Note that this 
type of prosodic ambiguity has been attested in Birkner’s (2012) study 
on the prosodic realisations of relative clauses in German. Birkner’s 
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These cross-linguistic prosodic features, which distin- 
guish the semantic restriction of relative clauses, are present in 
the prosodic record of the Masoretic Text. Example (1) illustrates 
a Tiberian Hebrew relative clause that is semantically restrictive, 
where the head noun (i3a) shares a cohesive intonational phrase 
(\) with its relative clause (139 nT'7Wy): 

(1) pa) PIN?) O37 72M) 
prepay) POI TT WY) PRR OY OFAN Xp") 
((and.she.bore Hagar)p (to.Abram)p (son)@)t 
((and.he.called Abram name.of-his.son)@ (whom-she.bore 
Hagar)@ (Ishmael)@)t 
‘And Hagar bore to Abram a son. And Abram called the 
name of his son whom Hagar bore, Ishmael.’ (Gen. 16.15) 
Example (2) illustrates a Tiberian Hebrew relative clause 
that is semantically non-restrictive, where the proper head noun 
(n177) and relative clause (FX’xin Ws) constitute separate intona- 
tional phrases (1). 
(2) (pT? 79WT)) 
(pin nsy) p(nawn7a)) 
((be.guarded for.yourself)@)t 
((lest-you.forget)@ (ACc-LORD)@)t 
((who he.brought.you)@ (from.land.of Egypt)@ (from. 
house.of slaves)@)t 


study demonstrates that these prosodic realisations are “more hetero- 
geneous than... presented in grammar books, complying neither with 
dichotomous semantics nor with the two postulated formats” (2012, 
20). 
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‘Guard yourselves, lest you forget the LORD, who brought 
you out of the land of Egypt, from the house of slavery.’ 
(Deut. 6.12) 

Another example of the semantic feature of the Masoretic 
accentual system involves the disambiguation of tripartite verb- 
less clauses from verbless clauses with left dislocation. Using a 
cross-linguistic framework, Naudé and Miller-Naudé (2017, 233) 
note that syntactic left dislocation “involves a gap at the bound- 
ary between the dislocated constituent and the matrix sentence,” 
which can be realised by a pause, interjection (Berman-Aronson 
and Grosu 1976), or intonational contour (Korchin 2015, 14-15). 
Naudé and Miller-Naudé use this gap feature to disambiguate the 
subject NP of a tripartite verbless clause from a left-dislocated NP 
construction. They observe that a left-dislocated NP in the bibli- 
cal text is phonologically characterised by a disjunctive accent 
that separates it from the matrix clause, while the pronoun of a 
tripartite clause (PRON) is phonologically conjoined to the sub- 
ject NP. The prosodic formats of these two types of constructions 
confirm this analysis. 

Example (3) exhibits the prosodic format of a left- 
dislocated verbless construction, where a prosodic phrase bound- 
ary separates the 3ms subject pronoun (X17) from the topicalised 
NP (on)). 

(3) u(p(om)) 
UpGwia’as) psi) 

((and.Ham)@)t 

(the) (father.of Canaan)@)t 

‘As for Ham, he was the father of Canaan.’ (Gen. 9:18) 
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Example (4) exhibits the prosodic format of a tripartite 
construction, where the 3ms pronoun (x1, i.e., PRON) shares the 
same internal intermediate phrase with the subject NP (owx7).” 
(4) (p29) (sin DWNT) E(NRXYNS) MC) 

((for)p (as.the.sin-offering)p (the.guilt-offering PRON)» 

(to.the.priest)@)t 

‘For just like the sin-offering, the guilt-offering belongs to 

the priest.’ (Lev. 14:13) 

These examples demonstrate that a cross-linguistic com- 
plexity approach to the Masoretic accents can more precisely 
identify and unify what is already known about the system, while 
also providing scientific avenues for exploring its semantic and 


discourse-pragmatic functions. 


4.0. Conclusion 


This chapter introduced Complexity Theory, a metatheory whose 
principles provide a framework for understanding language as a 
complex system, in order to introduce the merits and budding 
developments of complexity thinking in the study of biblical 
texts. The core attributes of a complex system were presented 
along with an overview of the ways in which complexity thinking 
has been applied thus far to five areas of biblical scholarship: 
diachrony, language pedagogy, translation theory, syntax, and 
the Masoretic accents of the Hebrew Bible. These studies have 


demonstrated the fruitful outcomes of such an approach and 


’2 Naudé and Miller-Naudé (2017, 223) describe the pronoun in a 
tripartite verbless clause as a “last resort syntactic strategy” for 
disambiguating the subject NP. 
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provide models for pursuing similar complexity approaches to 
linguistic research in biblical studies. 

Complexity Theory treats an object of study as part of 
an integrated and dynamic whole, not as an isolated or self- 
contained phenomenon. Within the domain of language and 
culture, this metatheory aims to describe the complex nature of 
linguistic phenomena as shaped by the interaction of multiple 
internal dimensions and external influences, rather than as con- 
tained within a particular autonomous analytical silo. Most cru- 
cially, a complexity approach to biblical scholarship provides the 
researcher with a model for thinking about the impact of adjacent 
and interrelated phenomena on the object of study, and for 
understanding how aspects of biblical languages and texts fit 
into larger systems of language and culture, ultimately offering 
“greater coherence in explaining what [is] already know[n]” 
(Larsen-Freeman and Cameron 2008, 11). A complexity approach, 
then, can serve as a counterbalance to the powerful scholarly 
inclination to isolate one’s object of study, encouraging the devel- 
opment of a research programme that takes into account the 


larger systems in which the phenomenon occurs. 
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GLOSSARY 


Ablaut 
Morphological vowel alternations in Indo-European languages. 
Adjunct 
An optional or omissable constituent in a predication (con- 
trast complement). 
Alveolar 
A place of articulation behind the upper teeth. 
Anaphor 
A grammatical element without independent reference, 
which gains its reference from an antecedent within the same 
structure (i.e., reflexive pronouns, reciprocal pronouns). 
Artificial Intelligence 
Techniques that enable computers to perform intellectual 
tasks normally performed by humans. 
Bilabial 
A place of articulation between the lips. 
Case Structure 
Case (capitalised) refers to the assignment of abstract gram - 
matical relations (Cases) within syntactic structure. 
Comment 
The material of a sentence that is the main point. Typically, 
it is new information. In Topic-Comment analysis, the Topic 


is the foundation on which the Comment is shared. 
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Complement 
A required constituent in a predication (e.g., direct object; 
contrast adjunct). 

Computer Linguistics 
Any form of linguistic analysis with computational methods. 

Concatenative Morphology 
The interweaving of vowel and consonant morphology in 
Semitic languages. 

Constituent Negation 
Negation with scope only over a constituent (contrast sen- 
tential negation). 

Contextualising Constituent 
Or Topic Constituent, part of a sentence that specially sig- 
nals a point of connection with the context. 

Copies 
See traces. 

Coronal 
A class of consonants articulated with the forward portion 
of the tongue. 

Corpus Linguistics 
Linguistic analysis based on text corpora. It studies the 
language as it is expressed in a corpus, be it a purposefully 
designed corpus (such as, e.g., the British National Corpus) 
or a corpus that has come down to us through tradition 
(such as the Old and New Testaments). 

Debuccalisation 


The loss of a place of articulation. 
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Deep Structure 
The underlying structure of language. 

Degemination 
The shortening of consonantal articulation. 

Deep Learning 
Extraction of patterns from data with the help of neural 
networks. A subgroup of machine learning. 

E-Language 
External language (see language performance). 

Empty Category 
A category that is not overtly present in surface structure 
but is present in the underlying (deep) structure (see also 
null constituent). 

Extraposition 
The movement of a constituent to the right (final) edge 
of the sentence (see also topicalisation and contrast right 
dislocation). 

Focus 
This term may indicate (1) the salient information of a 
sentence; (2) for generic emphasis, covering both Topic and 
Focus in Functional Grammar, including either part of a 
sentence or a whole sentence itself; or, in Functional Gram- 
mar, (3) specially marked salient information. The Focus 
may be marked by word order positions, intonation, or 
lexically. 

Functional Categories 
Categories within Universal Grammar whose members are 


items with no descriptive content, namely, D(eterminer), 
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T(ense), Asp(ect), M(ood) C (complementiser/coordinator; 
contrast lexical categories). 

Fricative 
A class of consonants articulated with a narrow opening in 
the vocal tract. 

Geminate 
The lengthening of consonantal articulation. 

Generative Linguistics 
A model of linguistic analysis developed by Noam Chomsky 
that considers grammar as a set of rules that generates gram- 
matical sentences in any given language from an innate uni- 
versal grammatical structure. 

Glottal 
A class of consonants articulated with the glottis. 

Grammaticalisation 
The process by which a lexical item loses its semantic con- 
tent and develops a grammatical function. 

Heavy Extraposition 
The movement of a constituent to the right (final) edge of 
construction across a sentence boundary. 

Heavy Topicalisation 
The movement of a constituent to the left (initial) edge of 
a construction across a sentence boundary. 

I-Language 
Internal language (see language competence). 

Information Structure 
The way a message is presented to an audience so that new 


information is built upon presumably shared information, 
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which may be explicit or implicit. The basic terms of infor- 
mation packaging are part of Topic-Comment analysis. 
Interdental 
A place of articulation between the teeth and the tongue. 
Intermediate Phrase 
The intermediate phrase is the minor phrase domain of the 
prosodic hierarchy above the prosodic word. The intermedi- 
ate phrase organises words into a cohesive prosodic unit. It is 
the main prosodic domain of the syntax-phonology interface. 
Intonational Phrase 
The intonational phrase is the major phrase domain of the 
prosodic hierarchy above the intermediate phrase. The into- 
national phrase is primarily associated with post-lexical 
meaning (viz. meaning above the word). 
Isogloss 
A linguistic innovation linking two or more speech 
communities. 
Language Competence 
The finite internal language knowledge of speakers that 
allows them to produce an infinite number of sentences and 
their meanings. 
Language Diffusion 
A diachronic process by which a dominant language 
spreads. 
Language Typology 
A cross-linguistic comparison and classification of lan- 


guages and their components according to shared features. 
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Language Performance 
An individual speakers’ use of language knowledge in acts 
of communication, namely, speech production and speech 
perception. 

Laryngeal 
A class of sounds hypothesised for Proto-Indo-European 
with direct reflexes in Hittite. 

Last Resort 
The principle that a grammatical operation applies only 
when there is no other means of satisfying a grammatical 
requirement. 

Left Dislocation 
A constituent that occurs before the left (initial) edge of the 
sentence and is represented within the sentence with a 
resumptive element (see also right dislocation and contrast 
topicalisation). 

Lexical Categories 
Categories within Universal Grammar whose members are 
items with descriptive content, namely, N(oun), V(erb), 
A (adjective/adverb), and P(reposition; contrast functional 
categories). 

Linked Data 
A method for publishing structured data using vocabularies 
that allow for interlinking with other data and useful 
semantic queries. 

Logical Form 
The underlying semantic representation of grammatical 
structure. 
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Machine Learning 
Algorithms that enable computers to learn without being 
explicitly programmed; a subgroup of Artificial Intelligence. 
Merge 
Movement of elements within the Minimalist Programme. 
Morpheme 
The minimal discrete unit of meaning in language. 
Move a 
Movement of elements within Principles-and-Parameters. 
Natural Language Processing 
The design and analysis of algorithms for processing and 
representing natural human language. 
Null Constituent 
A clause constituent that is present in the deep structure 
but not realised in the surface structure (also called zero 
constituent; see also empty category). 
Null Head 
The head of a phrase that is present in the deep structure 
but not realised in the surface structure. 
Null Subject Parameter 
A parameter present in some languages allowing a finite 
verb to occur without an overt subject in surface structure 
(also called pro-drop parameter). 
Open Science 
A movement that aims at more open scholarly practices, in 
which research data, software and publications are made 


available for reuse. 
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Palatal 
A class of consonants articulated with the palate. 
Parameter 
A language feature that may have different ‘settings’ in dif- 
ferent languages (e.g., the pro-drop parameter is present in 
some languages and absent in other languages). 
Pharyngeal 
A class of consonants articulated with the pharynx. 
Phone 
Any distinct sound unit. 
Phoneme 
The minimal sound unit; a building block for morphemes. 
Phonological Form 
The final, phonological shape of language at surface structure. 
Pitch Accents 
Pitch fluctuations in the flow of speech that are associated 
with accented syllables. 
Pro-Drop Parameter 
See null subject parameter. 
Prosodic Hierarchy 
The prosodic hierarchy is an abstract structure for con- 
ceptualising the phonological organisation of constituent 
domains within the flow of speech (e.g., higher-level con- 
stituents include the prosodic word, intermediate phrase, 
intonational phrase, and utterance). Since prosodic domains 
are organised from syntactic structures, the prosodic hier- 


archy mediates an indirect syntax-phonology interface. 
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Prosodic Orthography 
A conventionalised writing system of the phrase structure 
and key phonological and phonetic features (e.g., intona- 
tion, stress, segmentation) of a prosodic system. 

Prosodic Phonology 
A subfield of linguistics that provides the cross-linguistic 
theoretical framework for understanding the complex sys- 
tems of sounds and interfaces that govern the organisation 
of speech. 

Prosodic Phrase Structure 
The grouping of words in an utterance, characterised by 
smaller units embedded in larger ones. 

Prosodic System 
A complex system (viz. interfacing with other dimensions 
of language including phonology, syntax, and discourse- 
pragmatics) that regulates the way spoken language is real - 
ised. The phonetic features of a prosodic system are pitch, 
stress, segmentation, and voice quality. 

Prosodic Word 
Any word in an utterance that bears stress. In the prosodic 
orthography of Tiberian Hebrew (viz. the Masoretic accen- 
tual system), prosodic words are marked by either conjunc- 
tive or disjunctive graphemes. 

Proto-Indo-European 
The reconstructed ancestor of the Indo-European family of 
languages. 

Proto-Language 


A reconstructed common ancestor of a family of languages. 
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Right Dislocation 
A constituent that occurs beyond the right (end) edge of the 
sentence and is represented within the sentence with a 
resumptive element (see also left dislocation and contrast 
extraposition). 

Role and Reference Grammar 
A model of linguistic analysis developed by William A. 
Foley and Robert D. Van Valin, in which language is 
conceived as a system of communicative social action and 
which focuses on the communicative functions of gram- 
matical structures. 

Sentential Negation 
Negation with scope over the entire sentence. 

Sibilant 
A class of fricatives with acoustically prominent amplitude 
and pitch. 

Spirantisation 
The change from a stop articulation to a fricative. 

Structuralism 
A model of linguistic analysis developed building on the 
works of Ferdinand de Saussure, in which language is 
conceived as a self-contained system whose elements are 
defined by their relationship to other elements within the 
system. 

Stop 
A class of consonants produced by a momentary obstruc- 
tion of airflow in the vocal tract. 
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Surface Structure 
The overt realisation of language structure. 
Supervised Learning 
A form of machine learning in which the machine learns 
from human-labelled examples. 
Suppletive Morphology 
A paradigm with phonemically unrelated morphemes. 
Suprasegmental 
Phonetic features that map across multiple segments. 
Syntax-Phonology Interface 
The syntax-phonology interface refers to the interaction of 
syntactic constituents with phonological constituents. It is 
primarily observed in the phonological rules that reference 
syntactic constituents as the domains for their application. 
Tagmemics 
A model of linguistic analysis developed by Kenneth Pike, 
which takes the tagmeme as the basic unity of grammatical 
description, a tagmeme being defined as the correlation of 
a grammatical slot and the class of words by which it can 
be filled. 
Topicalisation 
The movement of a constituent to the left (initial) edge 
of the sentence (see also extraposition and contrast left 
dislocation). 
Theme/Rheme 


Alternate names for Topic and Comment, respectively. 
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Thetic 
Sentences where the whole sentence is presented as Com- 
ment material, without shared orientating information. 
May answer an implied question, ‘What happened?’ 

Topic 
This term may indicate (1) the subject matter of a text; (2) 
information assumed to be shared and that becomes the 
introduction to material in a sentence that is the main mes- 
sage, called the Comment; or, in Functional Grammar, (3) 
a pragmatic function that can activate word order place- 
ment rules. 

Traces 
The original positions of moved elements in the underlying 
structure. 

Transformational Grammar 
Part of the theory of generative linguistics, that refers to 
the transformations that are used to produce sentences. 

Triphthong 
A sequence of vowels flanking an intervocalic glide in 
Semitic languages. 

Universal Grammar 
Part of the theory of generative linguistics that refers to the 
hypothesised common properties of language shared by all 
natural languates. 

Unsupervised Learning 
A form of machine learning in which the machine has to 


detect patterns in the unlabelled data by itself. 


336 Linguistic Theory and the Biblical Text 


Velar 
The place of articulation between the velum and tongue 
body. 
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